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EDITORIAL 


When facts are not enough 


cience is based on a shared respect for the sci- 
entific method—the principle that, by gather- 
ing and analyzing data and information, sci- 
entists and others can draw conclusions that 
are robust and generalizable across cultures 
and ideologies. Scientists furthermore assume 
that disagreements can be resolved by more 

facts. So when people object to the reality of climate 

change with science-y sounding arguments—“the data 

is wrong,” or “it’s just a natural cycle,” or even, “we 

need to study it longer’—the natural response of scien- 

tists is simple and direct: 

People need more data. 

But this approach often 

doesn’t work and can even 


let people know that I am a Christian. Why? Because 
it’s essential to connect the impacts of a changing cli- 
mate directly to what’s already meaningful in one’s life, 
and for many people, faith is central to who they are. 
Scientists can be effective communicators by bond- 
ing over a value that they genuinely share with the 
people with whom they’re speaking. It doesn’t have to 
be a shared faith. It could be that both are parents, or 
live in the same place, or are concerned about water 
resources or national security, or enjoy the same out- 
door activities. Instead of beginning with what most 
divides scientists from 
others, start the conversa- 
tion from a place of agree- 
ment and mutual respect. 


Katharine Hayhoe 
is a professor 

and director of the 
Climate Science 
Center at Texas 
Tech University, 
Lubbock, TX, 
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backfire. Why? Because Then, scientists can con- USA. www. 
when it comes to climate nect the dots: share from katharinehayhoe. 
change, science-y sound- their head and heart why com 
ing objections are a mere they care. 
smokescreen to hide the Talking about impacts 
real reasons, which have isn’t enough, though. 
much more to do with Sadly, the most danger- 
identity and ideology than ous myth that many peo- 
data and facts. ple have bought into is, 

For years, climate change “it doesn’t matter to me,” 
has been one of the most and the second most dan- 
politically polarized issues gerous myth is, “there’s 
in the United States. To- nothing I can do about 
day, the best predictor of “ ..changing minds...requires... it.” If scientists describe 
whether the public agrees ‘ : the daunting challenge of 
with the reality of anthro- pr actical, viable, and climate change but can’t 
pogenic climate change is attractive solutions that someone offer an engaging solu- 
not how much scientific . tion, then people’s natural 
information there is. It’s can get excited about.” defense mechanism is to 
where each person falls disassociate from the re- 
on the political spectrum. That’s why the approach of | ality of the problem. That’s why changing minds also 
bombarding the unconvinced with more data doesn’t | requires providing practical, viable, and attractive so- 
work—people see it as an attack on their identity and | lutions that someone can get excited about. Concerned 
an attempt to change their way of life. homeowner? Mention the amazing benefits of energy 

Iam a climate scientist who has spent a lot of time | conservation. Worried parent? Bring up the practical 
trying to make climate science more accessible. I’ve | steps to take to make outdoor play spaces safer for 
authored National Climate Assessments and numer- | kids, even in the hot summer. Business executive? Talk 
ous outreach reports; I host a YouTube show called | about the economic benefits of renewables. 
Global Weirding; I tweet; I’ve even promoted knitting We all live on the same planet, and we all want the 
patterns that display rising temperatures. Yet the most | same things. By connecting our heads to our hearts, 
important step I’ve taken to make my science com- | we all can talk about—and tackle—the problem of 
munication more effective has nothing to do with the | climate change together. 
science. As uncomfortable as this is for a scientist in 
today’s world, the most effective thing I’ve done is to -Katharine Hayhoe 

10.1126/science.aau2565 
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Senator Sheldon Whitehouse (D-RI), on ex-Representative Jim Bridenstine (R-OK), now NASA 
administrator, who reversed himself and called humans the top cause of climate change. 


Edited by Jeffrey Brainard 


Mars rover’s drill revived 


“ a oY Ey 


Scientists will use the drill to continue searching for signs of ancient life on Mars. 


ome 18 months after it first failed, the drill on NASA’s Curiosity 

rover is once again sampling rocks on Mars, the agency announced 

last week. Since December 2016, the drill on the SUV-size rover, 

located at the end of a robotic arm, could not reliably penetrate 

rock after first being stabilized against its target. Engineers at 

the Jet Propulsion Laboratory in Pasadena, California, have 
now tested a technique that forgoes stabilization, instead using the 
robotic arm to plunge the drill into rock. In May, the drill penetrated 
50 millimeters into a veiny rock in a clay-rich formation. Scientists 
hope the revived drill will allow Curiosity to resume its chase for mo- 
lecular evidence of ancient martian life using its most precise mea- 
sure: mixing samples with thimbles of solvent, a technique it had yet 
to deploy when the drill failed (Science, 3 February 2017, p. 444). 
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Study charts health care gains 


PUBLIC HEALTH | Cambodia, Equatorial 
Guinea, Ethiopia, Myanmar, and Rwanda 
were among the countries with the big- 
gest gains in access to quality health care 
from 2000 to 2016. So says a new study 
published online in The Lancet this week 
by a massive group of researchers known 
as the Global Burden of Diseases group, 
led by the Institute for Health Metrics and 
Evaluation at the University of Washington 
in Seattle. The new assessment is based 
on mortality figures for 32 diseases that, 
if properly treated, should not be fatal. 
Most lower-income nations saw gains 
driven by expanded vaccination programs. 
Not surprisingly, nations with the high- 
est quality overall were Iceland, Norway, 
and the Netherlands, whereas the much 
poorer Central African Republic, Somalia, 
and Guinea-Bissau were at the bottom 

of the list. Large countries such as India 
and China had enormous differences 
within their borders—Beijing scored high, 
for example, whereas Tibet was low. The 
United States ranked 29th out of the 

195 countries and territories studied. 


Weather satellite in peril 


REMOTE SENSING | A balky cooling 

system has crippled the most recent 
multibillion-dollar U.S. weather satel- 

lite, the Geostationary Operational 
Environmental Satellite-17 (GOES-17), which 
stares fixedly at the western United States, 
the National Oceanic and Atmospheric 
Administration reported last week. The 
problem began a month ago on the satel- 
lite’s primary instrument, the Advanced 
Baseline Imager (ABI), used for forecasting 
and meteorological research. For half the 
day, while the ABI is exposed directly to the 
sun, the sensor must be cooled to record 
Earth’s emissions in the infrared, used for 
measuring wind speeds, moisture, and tem- 
perature. Launched on 1 March, GOES-17 is 
the second of four satellites in an 

$11 billion program. Although older satel- 
lites remain operational, the cooling failure 
could jeopardize planned gains in forecast- 
ing for the western United States and force 
a re-evaluation of systems, already built, for 
the two satellites set to follow GOES-17. 
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Court blocks Nobel Center design 


SCIENTIFIC SOCIETIES | A Swedish court 
last week blocked plans for a controversial 
new headquarters building for the Nobel 
Foundation on central Stockholm’s water- 
front, saying its design would mar the 
harbor’s historic character. The eight-story, 
bronze, glass, and steel Nobel Center would 
consolidate in one location the foundation’s 
activities, including the annual December 
award ceremonies for the world’s most 
prestigious science prizes, now held in 
Stockholm’s concert hall. Besides an 
auditorium, the 1.2 billion Swedish krona 
($140 million) center would house the 
Nobel Museum and space for exhibitions, 
educational programs, and a restaurant. 
The design, by the Berlin office of David 
Chipperfield Architects, has sparked 
controversy since it was unveiled in 2014. 
It was scaled back in 2015 and revised 
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RENEWABLE ENERGY 


East Asia becomes a hydropower superpower 


ydropower is the largest source of renewable energy worldwide, and projects 

in East Asia, particularly China, have driven global growth in gigawatts of 

hydroelectric generating capacity since 2006. Planned projects in South 

Asia, especially India, could further boost capacity, according to a report 

by Oak Ridge National Laboratory in Tennessee for the U.S. Department of 
Energy. In China, new capacity could come from building large new dams in the 
nation’s west. In the United States and Europe, where environmental concerns 
have made the construction of new dams more controversial, some of the planned 
expansion involves modifying existing dams to produce more power. 


again in 2016. But on 22 May the Land and 
Environment Court said the revised plans 
would still damage the location’s historic 
character, lacked an adequate traffic plan, 
and would block views of the harbor from 
neighboring buildings. Stockholm city 
officials said they will appeal the court’s 
decision to the regional appeals court, 
which will have the final say. 


Space station’s cool study 


PHysics | Low temperatures reached 
new heights when the automated Cold 
Atom Laboratory (CAL) arrived at the 
International Space Station on 24 May. 
The rig will use laser light and radio waves 
to trap clouds of atoms and chill them to 
within a billionth of a degree of absolute 
zero. At those temperatures, a macroscopic 
quantum wave called a Bose-Einstein 
condensate (BEC) emerges that can persist 


Critics of the planned Nobel Center (center) 
oppose its impact on Stockholm’s historic waterfront. 
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longer in the weightless environment than 
on Earth. CAL scientists will strive for 
record-low temperatures, look for novel 
quantum effects, and even blow BEC bub- 
bles. An upgrade could follow in a few years. 


A‘right to try’ unapproved drugs 


DRUG DEVELOPMENT | A bill passed by 
Congress last week would give people with 
life-threatening illnesses a new means 

to petition drug companies to give them 
experimental treatments without the U.S. 
Food and Drug Administration (FDA) 
evaluating the request. The “Right to Try” 
bill, expected to get President Donald 
Trump’s signature, permits terminally ill 
patients who have exhausted other treat- 
ment options to obtain an unapproved 
drug, as long as it is undergoing clinical 
trials and has completed an initial safety 
study. Patients can already make “compas- 
sionate use” requests to FDA; the agency 
says it grants nearly all of these, but sup- 
porters of the new alternative say it will 
be less cumbersome and time-consuming. 
Critics—including many patient advocacy 
groups—have warned that it could put 
people at risk of harmful drug side effects. 
The new legislation limits the liability 

of drug manufacturers and prescrib- 

ers for providing unproven drugs, but 
does not oblige companies to do so. FDA 
Commissioner Scott Gottlieb, who raised 
concerns about previous versions of the 
bill, said he stands “ready to implement it 
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in a way that achieves Congress’s intent to 
promote access and protect patients.” 


Still no sign of dark matter 


PHysics | The world’s most sensitive 
dark matter detector, XENONIT, has failed 
to snare its quarry, researchers reported 
this week. Every galaxy sits shrouded in 
invisible dark matter whose gravity keeps 
it from flying apart, observations indi- 
cate. In the 1980s, physicists suggested 


that dark matter might consist of weakly 
interacting massive particles (WIMPs) that 
would interact only through gravity and 
the weak nuclear force and should linger 
from the big bang in the right amount to 
account for dark matter. However, the 
165 XENON researchers report that during 
a year of observations, its latest detector, 
located in Italy’s subterranean Gran Sasso 
National Laboratory, saw no evidence 

of WIMPs hitting atomic nuclei in 1300 
kilograms of liquid xenon. The team plans 


THREE QS 
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Twisting genetics for hate 


White supremacists 
marching in Charlottesville, 
Virginia, in August 2017. 


A year ago, as graduate student Jedidiah Carlson at the University of Michigan in Ann 
Arbor searched online for a population genetics paper, one result stood out: a link to a 
forum post on the website Stormfront, one of the internet's most notorious hangouts 
for white supremacists and neo-Nazis. Curious, he clicked through and found forum 
users hosting “informal journal clubs,” knowledgeably discussing details of population 
genetics—and badly mangling others—to find support for a white nationalist ideology. 
He has followed the evolution of this strange, racist trend on that website and “alt-right” 
forums, and he chatted with Science about what he’s found. (A longer version of this 
interview is available at https://scim.ag/QApopgenetics.) 


Q: Do people on these forums know 

what they’re talking about? 

A: There's a range of expertise. It behooves 
them to know the science so they can 

feel like they are going toe-to-toe with the 
people actually doing it. There are even 
some who have graduate degrees. They 
aren't population geneticists, but some of 
them are quite well educated, both formally 
and informally. 


Q: How do they misappropriate the work? 
A: They grab figures from scientific papers 
and edit them to make them look like they 
support the white nationalist ideology. In 
one example, researchers published a fig- 
ure with a plot inferring regional ancestry 
of dozens of different populations based 
on hundreds of individuals and roughly 
divided them into clusters. [Some forum 
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users] take this plot and add text like “The 
genetic reality of race,” with no context 
showing what the scientists were actually 
looking at, and ignoring the fact that 
there's a continuum among the individuals. 
Then they turn these images into memes 
and try to make them go viral. 


Q: What are they after? 

A: They're interested in anything that would 
reinforce traditional, discrete racial catego- 
ries. Intelligence is probably the No. 1 topic 
that they gravitate toward. And anything 
pertaining to history of human migrations, 
or things that play into traditional clas- 
sifications of racial phenotypes like facial 
morphology or skin color. | don’t think 
engaging them directly will help, but | think 
we need to think more carefully about how 
our work might be misinterpreted. 
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to quadruple the detector’s mass, even as 
physicists in South Dakota build a similar 
7-metric-ton detector called LUX-ZEPLIN. 
If those efforts come up dry, the next step 
is uncertain. 


Taiwan replaces research ship 


MARINE SCIENCE | Taiwan’s marine 
researchers are celebrating the commis- 
sioning last week of Legend, a $30 million 
replacement for a ship sunk in an October 
2014 accident that claimed the lives of two 
scientists (Science, 24 October 2014, 

p. 407). The island’s marine research 
“really suffered” for the past several 

years, says Wang Chau-Chang, director of 
the Taiwan Ocean Research Institute in 
Kaohsiung. The 2600-ton, 76-meter-long 
Legend is smaller than its predecessor. 
But it has improved features, including 
updated sonar equipment for sea bot- 

tom profiling, coring capabilities, and 

the ability to deploy a remotely operated 
vehicle capable of diving to 3000 meters, 
all supporting “many different combina- 
tions of scientific experiments,” Wang says. 
After training cruises, Legend will make its 
maiden scientific voyage in September. 


EU ban on consumer plastics 


CONSERVATION | To reduce marine pollu- 
tion, the European Commission last week 
proposed to ban or increase recycling of 
consumer plastic items. Straws, cutlery, and 
plates would be removed from the mar- 
ket; EU member states would be required 
to collect 90% of single-use plastic drink 
bottles by 2025, using, for example, deposit 
refunds. Producers of plastic fishing gear 
would have to collect used material from 
ports and fund public education campaigns 
to encourage proper disposal. Plastics make 
up 85% of marine litter. 


Down under, hordes look up 


ASTRONOMY | Australians last week 
apparently set a Guinness World Record 
for the largest number of people star- 
gazing across multiple venues at once— 
more than 40,000, smashing the previous 
record of 7960, also set by Australians, in 
2015. A final tally of participants was still 
being made when Science went to press. 
The 23 May event featured more than 
285 registered stargazing parties and 
thousands of telescopes deployed across 
the country, the Australian Broadcasting 
Corporation reported. 


S SCIENCEMAG.ORG/NEWS 
Read more news from Science online. 
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ENVIRONMENTAL RESEARCH 


Scientists aim to smoke out wildfire impacts 


U.S. campaign to study how plumes affect human health and environment 


By Warren Cornwall 


mily Fischer is likely one of the few 

people whose summer plans were 

buoyed by a recent forecast that much 

of the western United States faces 

another worse-than-normal wildfire 

season. Unusually warm weather and 
drought, together with plenty of dry grass 
and brush, are expected to create prime 
conditions for blazes this summer, federal 
officials announced on 10 May. 

The forecast has local officials bracing 
for the worst. But it represents an oppor- 
tunity for Fischer, an atmospheric scientist 
at Colorado State University in Fort Col- 
lins who is preparing to spend the summer 
flying through plumes of wildfire smoke 
aboard a C-130 cargo plane jammed full 
of scientific equipment. The flights are the 
highlight of an unprecedented effort, cost- 
ing more than $30 million, that involves 
aircraft, satellites, instrumented vans, and 
even researchers traveling on foot. Over the 
next 2 years, two coordinated campaigns— 
one funded by the National Science Foun- 
dation (NSF), and the other by NASA and 
the National Oceanic and Atmospheric Ad- 
ministration (NOAA)—aim to better under- 
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stand the chemistry and physics of wildfire 
smoke, as well as how it affects climate, air 
pollution, and human health. 

“This is definitely the largest fire ex- 
periment that has ever happened,” says 
atmospheric chemist Carsten Warneke of 
NOAA’ Earth System Research Laboratory 
in Boulder, Colorado, one of the lead sci- 
entists. Wildfire smoke, he adds, is “one of 


A smokier future 

Forecasts suggest the western United States will see 
more wildfire smoke waves (events lasting 2 days or 
more) and longer periods when smoke waves occur. 


@ Present (2004-2009) @ Future (2046-2051) 


30 


Smoke wave 
days per year 


Length of smoke 
wave season 
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the largest problems facing air quality and 
climate issues going forward.” 

The problem is growing as the size and 
intensity of wildfires rise in the western 
United States, marinating communities in 
smoke. Wildfires account for more than 
two-thirds of the particulate matter in the 
West on days that exceed federal clean air 
standards, according to a 2016 study in 
the journal Climatic Change. And global 
warming is likely to stoke even more fire in 
coming years, by making wildlands more 
combustible. By midcentury, more than 
80 million people living across much of the 
West can expect a 57% increase in the num- 
ber of “smoke waves’”—events that shroud a 
community for 2 days or more—according to 
the 2016 study. The consequences for public 
health could be sobering; smoke includes an 
array of noxious compounds and tiny parti- 
cles that can complicate breathing and pro- 
mote disease. Other parts of the Americas as 
well as Europe, Africa, Asia, and Australia 
are likely to experience the same climate- 
driven surge in wildfires, according to U.S. 
Forest Service researchers. 

Despite the potential threat, wildfire 
smoke has received little sustained scien- 
tific attention. The two new campaigns aim 
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Smoke from a nearby wildfire blanketed Seeley Lake, 
Montana, for weeks in the summer of 2017. 


to change that. This year, the NSF-funded 
team that includes Fischer aims to fly its 
instrumented C-130 through 15 to 20 wild- 
fire plumes. And next year, researchers with 
NASA and NOAA will have access to a big- 
ger aircraft—a DC-8 jet—that will scour 
smoky skies across the United States. 

One goal is to inventory the chemicals 
released by wildfires, including nitrogen 
oxides and carbon monoxide, and a vast 
array of volatile organic compounds. Cur- 
rent models for predicting the chemical 
makeup of smoke, which rely largely on 
satellite observations, have a huge margin 
for error, Warneke says. In part, that’s be- 
cause of uncertainty about how much veg- 
etation wildfires consume. New studies that 
combine data from satellites, aircraft, and 
ground-based researchers scrutinizing burn 
sites should help fine-tune those estimates. 

Researchers will also use the C-130 to 
chase plumes during their first 24 hours aloft, 
to see how the chemistry of smoke changes 
as it wafts through the at- 
mosphere. In particular, 
they hope to get a more 
precise picture of what 


“Tf wildfires are 


The goal of collecting such a wide array of 
data, Warneke says, is “to do the whole pic- 
ture at one time and understand how the 
whole thing plays together.” 

Beyond these projects, public health re- 
searchers are taking a growing interest in 
what happens when smoke blankets com- 
munities, sometimes for weeks at a time. 
Past studies have found that short-term 
smoke exposure can increase problems 
for people with asthma and other lung 
ailments, but “there’s really not much in- 
formation at all” about the effects of long- 
term, chronic exposure, says Curtis Noonan, 
an environmental epidemiologist at the 
University of Montana in Missoula. 

Noonan was at the center of some of 
the worst smoke of the 2017 wildfire sea- 
son, when Montana was hit by fires that 
burned 400,000 hectares. The biggest blow 
fell on Seeley Lake, a town of 1600 located 
50 kilometers northeast of Missoula. The 
nearby Rice Ridge fire filled the town with 
smoke for much of August and early Sep- 
tember 2017, driving levels of fine particu- 
late matter to nearly 20 times the acceptable 
limit set by the Environ- 
mental Protection Agency. 

Noonan is now work- 
ing with colleagues to 


happens to the nitrogen burning in the gather health information 
released by burned veg- ane from residents of Seeley 
etation, including how P acifi Ic Northwest, Lake and several other 
much is converted into what does that mean Montana towns. They 


nitrogen oxides that can 
contribute to ground-level 
ozone, a regulated pol- 
lutant that can worsen 
breathing problems. Such 
data, Fischer says, could help answer ques- 
tions like: “If wildfires are burning in the 
Pacific Northwest, what does that mean for 
Colorado air quality?” 

Researchers also plan to track the evo- 
lution of aerosols—minute particles that 
can either scatter or absorb sunlight, and 
play a major role in shaping climate. And 
they want to observe plumes as they col- 
lide with clouds, to better understand how 
the two affect each other. Smoke particles 
can alter cloud formation by acting as nu- 
clei for ice particles, potentially influencing 
the weather, and cloud moisture can alter 
smoke chemistry. The results could help im- 
prove weather forecasts. 

At night, falling temperatures can cause 
smoke plumes to sink into valleys, wors- 
ening air quality there. NOAA and NASA 
researchers will track the plumes with air- 
craft, vans, and a drone. That initiative will 
also involve DC-8 flights beyond the West, 
into the Midwest and Southeast, tracking 
smoke from fires intentionally set to clear 
farm fields and prescribed burns in forests. 


Emily Fischer, 
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for Colorado ... ?” 


Colorado State University 


aim to track how respira- 
tory performance, mental 
states, and genetic mark- 
ers related to inflamma- 
tion change when smoke 
descends. Noonan is also seeking funding 
to sift through health records of wildland 
firefighters to understand how sustained 
smoke exposure has affected them. 

Farther north, Sarah Henderson, an en- 
vironmental epidemiologist and veteran 
smoke researcher at the British Columbia 
Centre for Disease Control in Vancouver, 
Canada, hopes to track the fate of children 
born during high-smoke events. One big 
question, she says, is: “If you’re born into re- 
ally smoky conditions with your extremely 
sensitive, newborn lungs, what does that 
mean for you?” 

As scientists prepare to tackle such ques- 
tions, health officials in Missoula are pre- 
paring for a possible repeat of last year’s 
smoke waves. The health department is 
stockpiling indoor air filters for day care 
centers, schools, and other gathering spots. 

Fischer, for one, hopes they aren’t needed. 
Although she requires fire for her studies, 
she says, “I’m just wishing for an average 
wildfire year with wildfires in wilderness ar- 
eas that don’t cause any property damage.” 
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Will U.S. 
academies 
expel sexual 
harassers? 


Prestigious bodies consider 
changing bylaws to eject 
members who offend 


By Meredith Wadman 


s high-profile sexual harassment 

cases fuel public criticism, the presi- 

dents of the U.S. National Acad- 

emies of Sciences, Engineering, 

and Medicine announced last week 

they may adopt new policies allow- 
ing the prestigious bodies to eject mem- 
bers who have committed harassment and 
other forms of misconduct. Members of 
the academies—which serve as both hon- 
orific societies and advisers to the U.S. 
government—are elected by existing mem- 
bers to life-long terms, and the bodies cur- 
rently lack mechanisms for removing them 
for harassment. 

Because scientists and the public “place 
much trust” in the three Washington, 
D.C.-based academies, their leadership 
councils “have begun a dialogue about 
the standards of professional conduct 
for membership,” the presidents said in a 
22 May statement. “We want to be sure 
that we are doing everything possible to 
prevent sexual harassment, to instill a 
culture of inclusion and respect, and to 
reinforce that harassment is not toler- 
ated.” The statement was signed by Marcia 
McNutt, head the National Academy of Sci- 
ences (NAS); C. D. Mote Jr., head of the Na- 
tional Academy of Engineering (NAE); and 
Victor Dzau, head of the National Academy 
of Medicine (NAM). 

Some researchers welcomed the an- 
nouncement. “This may seem small, but 
as someone who’s been working with them 
for 2 years, this is BIG for this organiza- 
tion,” tweeted Kate Clancy, an anthropo- 
logist who studies sexual harassment 
in science at the University of Illinois in 
Urbana. Clancy helped author an NAS re- 
port on sexual harassment in science that 
will be released on 12 June. 

But Clancy and others also expressed 
concern that the “dialogue” would not lead 
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to major changes, and criticized the acad- 
emies for moving too slowly. “McNutt and 
other science society presidents have a 
moral imperative to get these knuckleheads 
out. Folks need to know women in science 
are done waiting for our societies to have 
our backs,’ says neuroscientist BethAnn 
McLaughlin of the Vanderbilt University 
School of Medicine in Nashville. On 6 May, 
angered by an article in Science detailing 
allegations of sexual harassment by scien- 
tist and NAS member Inder Verma of the 
Salk Institute for Biological Studies in San 
Diego, California (Science, 4 May, p. 480), 
McLaughlin launched an online petition 
asking NAS to eject members sanctioned for 
sexual harassment. (Verma has denied the 
allegations; Salk is investigating.) 

McNutt argues that getting it right will 
take time at the staid and governance-laden 
academy she runs. NAS’s 17-member gov- 
erning council—which includes 11 women— 
will discuss the issue at its next meeting, 


entific societies, and research funders are 
scrambling to respond to a wave of allega- 
tions of sexual harassment and scientific 
misconduct. One society, the American 
Geophysical Union, has expanded its defi- 
nition of research misconduct to include 
sexual harassment and rescinded an honor 
bestowed on a scientist facing allegations of 
misbehavior. Universities have suspended 
or fired researchers under investigation for 
harassment, or found to be harassers. 

Until last week, however, the academies 
had remained mostly silent on how they 
might address misconduct among their 
7000 members, despite mounting criti- 
cism. As of 29 May, more than 2700 peo- 
ple had signed the petition launched by 
McLaughlin, which asked NAS to kick out 
members “who have been sanctioned for 
sexual harassment, retaliation and assault.” 

The conundrum presented by lifetime 
memberships in the academies has been 
apparent for some time. NAM member 


; | 
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Marcia McNutt, head of the U.S. National Academy of Sciences, wants members to consider ejecting harassers. 


McNutt tweeted, with an eye toward having 
the full NAS membership vote on amend- 
ing its bylaws at its annual meeting next 
April. The outcome of that vote is not cer- 
tain, McNutt suggested in a tweet. “Anyone 
who thinks it is easy [to change bylaws] 
has not tried to get a majority vote from 
an honorary society of more than 80 per- 
cent men over 70 years average age.” (The 
average age of the 2382 NAS members is 72; 
84% are men.) 

NAM and NAE referred questions about 
how those two bodies will proceed to NAS, 
which said it is too early to know. 

The effort comes as U.S. universities, sci- 
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Arthur Kellermann, dean of the Uniformed 
Services University of the Health Sciences 
(USUHS) in Bethesda, Maryland, last year 
filed a complaint urging his academy to 
eject physician Eric Noji. A 2016 investiga- 
tion by the university found that Noji, who 
had been an adjunct professor at USUHS, 
had plagiarized several research papers 
and misrepresented his credentials prior 
to his election to NAM, according to The 
New York Times. The university fired Noji, 
but he remains a member of NAM. In late 
2016, the case prompted NAM to adopt a 
new rule allowing it to “rescind” a mem- 
bership if “it is later determined by an au- 
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thorized body that the information relied 
on for election is false.” 

Other current academies members could 
be vulnerable to expulsion under new by- 
laws covering harassment. In 2015, as- 
tronomer Geoffrey Marcy resigned from 
the University of California, Berkeley, after 
Buzzfeed published the results of a univer- 
sity investigation that concluded he had vi- 
olated its policies on sexual harassment. In 
March, Thomas Jessell, a Columbia Univer- 
sity neuroscientist, was fired by the school 
and the Howard Hughes Medical Institute 
after a university investigation concluded 
he had violated Columbia’s policies on 
consensual sexual relationships between 
faculty and students, according to the Co- 
lumbia Daily Spectator. 

Exactly how the academies would carry 
out an expulsion and what evidence they 
might require before acting are just two 
questions facing their leadership coun- 
cils. Some argue that findings of miscon- 
duct by an institution should be enough 
to hit the eject button. But NAS member 
Robert Weinberg, a cancer scientist at the 
Massachusetts Institute of Technology in 
Cambridge, disagrees. “I have little faith 
that findings of sexual harassment by the 
host institutions of accused parties will be 
adjudicated in a fashion that applies uni- 
form standards of evidence and fairness,” 
Weinberg wrote in an email to Science. 

“It’s important to make sure there was 
due process. ... Allegations should not be 
sufficient to eject a member,’ says Robert 
Cook-Deegan, a longtime observer of the 
academies and a science policy specialist 
with Arizona State University who is based 
in Washington, D.C. He adds: “The acad- 
emies would do well not to get into the polic- 
ing and prosecution, but rather identify the 
findings of courts or systematic and credible 
administrative processes.” 

Unlike the U.S. academies, several foreign 
honorary societies do have procedures for 
ejecting members for misconduct. But the 
German academy (called the Leopoldina) 
and the Royal Netherlands Academy of Arts 
and Sciences have not in living memory 
ejected members, according to their spokes- 
people. At the Royal Society in the United 
Kingdom, “the list of expulsions is largely 
historical and the top reason cited is non- 
payment of subscriptions,” a spokesperson 
wrote in an email. 

AAAS (publisher of Science) in Washing- 
ton, D.C., also has no mechanism for eject- 
ing fellows, who are elected for life. But 
spokesperson Tiffany Lohwater says AAAS 
leaders are drafting a policy for revoking 
fellows who don’t live up to “the commonly 
held standards of professional ethics and 
scientific integrity.” 
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HUMAN EVOLUTION 


New copies of old gene drove brain expansion 


Research shows how genes supercharge neuron formation during human development 


By Elizabeth Pennisi 


hree nearly identical genes could 

help explain how 0.5 liters of gray 

matter in early human ancestors 

became the 1.4-liter organ that has 

made our species so successful and 

distinctive. The newly identified 
genes could also help explain how brain 
development sometimes goes wrong, lead- 
ing to neurological disorders. 

The genes, descendants of an ancient 
developmental gene that multiplied and 
changed in the course of evolution, add 
to a growing list of DNA implicated in 
human brain expansion (Science, 3 July 
2015, p. 21). But they stand out because 
so much has been learned about how they 
work their magic, says James Noonan, an 
evolutionary genomicist at Yale Univer- 
sity. Researchers have shown that this trio 
boosts the number of potential nerve cells 
in brain tissue, and one team even pinned 
down the protein interactions likely re- 
sponsible. “These are new proteins that 
are potentially modifying a very important 
pathway in brain development in a very 
powerful way,” Noonan adds. 

Until now, the four genes were thought 
to be one, NOTCH2NL, itself a spinoff of 
the NOTCH gene family, which controls 
the timing of development in everything 
from fruit flies to whales. But two studies 
in the 31 May issue of Cell trace a series 
of genetic accidents in recent evolutionary 
history that have yielded four very closely 
related NOTCH2NL genes in humans (see 
graphic, right). 

David Haussler, a bioinformatician at the 
University of California, Santa Cruz, and his 
colleagues got on the trail of the genes after 
they discovered that the NOTCH pathway 
works differently in human and macaque 
brain organoids—test tube models of the 
developing brain. NOTCH2NL was missing 
in the macaque organoid and, later analyses 
showed, in other nonhuman apes as well. 
That suggested NOTCH2NL might have 
played a unique role in human evolution. 

By comparing NOTCH2NL-related DNA 
in the genomes of humans and other pri- 
mates, Haussler’s team reconstructed the 
genes’ evolutionary history. They concluded 
that during DNA replication perhaps 14 mil- 
lion years ago, part of an ancestral NOTCH2 
gene was copied by mistake. The new 
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“gene” was incomplete and nonfunctional, 
but about 11 million years later—shortly 
before human ancestors’ brains began to 
expand—an additional piece of NOTCH2 
got inserted into this copy, making the gene 
functional. “This event marks the birth of 
the NOTCH2NL genes we now have in our 
brains,” says Frank Jacobs, a co-senior au- 
thor on the paper and an evolutionary gen- 
omicist at the University of Amsterdam. 
Subsequently, that active NOTCH2NL 
gene was duplicated twice more, yielding 
three active NOTCH2NL genes in a row at 
one end of human chromosome 1 and one 
inactive copy on the other end. Gene copies 


A boost for the brain 


In an ape ancestor, duplication of NOTCH2 yielded 
a nonfunctional NOTCH2NL (gray). Repair and later 
gene duplications produced multiple working copies. 
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can be potent evolutionary forces because 
one copy continues its necessary job, leav- 
ing the others free to do something new. 

Pierre Vanderhaeghen, a developmen- 
tal neurobiologist at the Free University of 
Brussels, uncovered the same set of genes 
when he found a way to screen human fe- 
tal brain tissue for duplicated genes. To 
find out what they do, his team ramped up 
NOTCH2NL activity in cultured brain tis- 
sue. The tissue made more stem cells, they 
report in the second Cell paper. 

The finding complements one reported 
earlier this spring by Wieland Huttner, a 
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neurobiologist at the Max Planck Institute 
of Molecular Cell Biology and Genetics in 
Dresden, Germany. He and his team had de- 
cided to focus on NOTCH2NL (which they 
thought was a single gene) after finding it 
was highly active in fetal brain cells. When 
they put a human NOTCH2NL gene into 
incipient brain tissue from mice embryos, 
more stem cells developed. That suggests 
the human gene delays the specialization 
of those cells so they have a chance to pro- 
duce many more copies of themselves, the 
researchers reported in eLife on 21 March. 

Now, in their Cell paper, Vanderhaeghen 
and his colleagues describe molecular de- 
tails of how NOTCH2NL works to boost 
neuron formation. They found that a 
NOTCH2NL protein blocks a key step in a 
signaling pathway that causes stem cells 
to differentiate and stop dividing. As a 
result, the cells persist and keep produc- 
ing progeny, ultimately yielding a larger 
crop of neurons. “That’s really compel- 
ling biological data,’ Noonan says. “In 
other studies of genes involved in human 
evolution, it’s been very difficult to draw 
a line from the genetic difference to the 
phenotype to a biochemical mechanism 
that’s responsible.” 

The location of the three active NOTCH2NL 
genes is also telling, Haussler says. They are 
smack in the middle of DNA implicated in 
autism, schizophrenia, and a developmen- 
tal delay syndrome. Such duplicated DNA is 
prone to getting copied extra times or losing 
DNA during replication, and instability is a 
hallmark of these disorders. To Greg Wray, 
an evolutionary developmental biologist at 
Duke University in Durham, North Carolina, 
this clue to brain diseases is the most com- 
pelling new result. “These genes likely play 
an important role in cortical development, 
and misregulation leads to disease,” he says. 

Wray is less convinced that the genes had 
a unique role in human evolution because 
the chromosomal region in which they re- 
side is complex and difficult to sequence, 
and because the evidence for an evolution- 
ary difference in gene function between hu- 
mans and other species is indirect. 

But Haussler thinks these genes will 
prove key players in human brain expan- 
sion. “One change didn’t do it alone, but 
some will be found to be more fundamen- 
tal than others,” he points out. “VOTCH2NL 
has a shot at this.” 
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RESEARCH FUNDING 


Europe's science spending set 
for another big boost 


Commission seeks €97.6 billion for “Horizon Europe” 


By Tania Rabesandratana 


n 7 June, the European Commission 

will lay out detailed plans for one of 

the biggest single research programs 

on the planet. Called Horizon Europe, 

the program could be worth €97.6 bil- 

lion between 2021 and 2027, up from 
about €77 billion for the current 7-year pro- 
gram, Horizon 2020. Its influence, however, 
will go beyond size. 

Europe’s research programs provide stable 
funding for 7 years, some of it up for grabs for 
researchers around the world. And although 
they represent less than 10% of the total re- 
search money available in the European 
Union, the continuous growth of the EU sci- 
ence budget in the past decades (see graph, 
right), at the expense of agriculture and re- 
gional development, is a clear signal that it 
sees research and innovation as the future 
drivers of its economy. 

Next week’s proposals are unlikely to con- 
tain major surprises, because the commis- 
sion has unveiled its main ideas over the past 
months, in particular its overall 7-year bud- 
get plan, issued on 2 May. Although Horizon 
Europe will keep Horizon 2020’s main fea- 
tures, the commission has laid the ground- 
work for several novelties, including a new 
agency to tackle the continent’s perennial 
innovation problem and a big, separate push 
on collaborative defense research. But con- 
tentious negotiations lie ahead. The United 
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Kingdom is negotiating the terms of its im- 
pending exit from the European Union, and 
some member states want to tighten bud- 
gets. Meanwhile, research advocates want 
more generous spending, noting the low ap- 
plication success rates in Horizon 2020—a 
frustrating 11.9% so far. 

Like previous programs, Horizon Europe 
will have three main “pillars”; next week’s 
plan will detail how much money could go 
to each. The first component, called Open 
Science, will provide funding for projects 
“driven by researchers themselves,” as the 
commission puts it, through the well-liked 
basic research grants of the European Re- 
search Council (ERC) in Brussels and the 


Science on the rise 
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The company that developed HELIOtube, an 
inflatable solar heat collector, received funding under 
Horizon 2020. The European Commission plans to 
give innovation a bigger boost in Horizon Europe. 


Marie Sktodowska-Curie fellowships for 
doctoral programs, postdocs, and staff ex- 
changes. This part of the program is largely 
unchanged from Horizon 2020. 

The second pillar, Global Challenges, 
will set so-called missions addressing is- 
sues “that worry us daily such as the fight 
against cancer, clean mobility, and plastic- 
free oceans,’ says a commission fact sheet. 
The “missions” are meant to be flexible, as 
priorities change, and they appear to have a 
sharper focus than the “societal challenges” 
named in a comparable pillar of Horizon 
2020, including energy and food security. 

The third part of Horizon Europe, called 
Open Innovation, addresses an old problem: 
Europe’s shortage of successful innovative 
businesses, despite its world-class science. 
At the moment, EU science funding for busi- 
nesses largely goes through sizable public- 
private partnerships involving big firms, 
for example in the fields of aeronautics and 
pharmaceuticals. Now, EU research com- 
missioner Carlos Moedas is launching a new 
project, the European Innovation Council 
(EIC), to encourage startup companies and 
“breakthrough technologies.” 

The commission says EIC will differ from 
the European Institute of Innovation and 
Technology (EIT) in Budapest, set up in 
2008. EIT—the pet project of former commis- 
sion President José Manuel Barroso—brings 
together businesses, research centers, and 
universities in six pan-European “Innovation 
Communities.” Some observers say EIC’s cre- 
ation signals that EIT didn’t quite deliver and 
is being marginalized. 

EIC will use the ingredients that made 
ERC successful: focusing on individual en- 
trepreneurs rather than big cross-border 
teams, letting ideas emerge from the bot- 
tom up, and keeping grants and procedures 
simple. Its success “will depend on putting 
the right evaluation system into place,” says 
Austrian sociologist and former ERC Presi- 
dent Helga Nowotny. “It takes excellence to 
recognize excellence.” But many universities 
are upset that the current pilot program for 
EIC, worth €2.7 billion for 3 years, didn’t 
include them in its group of advisers. “Next 
to CEOs and entrepreneurs, there is also 
room for researchers,” says Kurt Deketelaere, 
secretary-general of the League of European 
Research Universities in Leuven, Belgium. 
He adds that there are more pressing barri- 
ers to innovation than a lack of funding, not- 
ing that the European Union’s 28 member 
states “have 28 different schemes for taxa- 
tion, intellectual property, bankruptcy.” 
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In addition to Horizon Europe, the com- 
mission has proposed another bold move 
for research: setting aside €4.1 billion over 
7 years as a separate budget line for defense 
research, up from just €90 million under an 
ongoing 3-year pilot program. Member states 
have long been lukewarm about cooperation 
in this secretive area, where national inter- 
ests prevail. But in times of growing “geopo- 
litical instability,’ as the commission puts it, 
some member states seem more willing to 
pool resources. 

Yet some 700 scientists have signed a pe- 
tition against any EU funding of military 
research; others worry the plan could come 
at the expense of nonmilitary science. “We 
will oppose anything that could take fund- 
ing away from Horizon Europe’s civilian re- 
search,” says Maud Evrard, head of policy 
at Science Europe in Brussels, a group of 
national science funding agencies and re- 
search organizations. 

The commission’s €97.6 billion opening 
bid represents a 27% increase from the previ- 
ous 7-year period—or even a 46% rise if com- 
pared to Horizon 2020 without the share of 
the United Kingdom, which is leaving the Eu- 
ropean Union in March 2019. But with some 
member states keen to tighten the European 
Union’s purse strings, Horizon Europe's bud- 
get is likely to go down in coming negotia- 
tions with the European Parliament and EU 
member states. As a result, both Evrard and 
Deketelaere say they are disappointed that 
the commission didn’t aim higher. 

Negotiations for such programs can easily 
stretch to at least 18 months, but the com- 
mission wants to make as much progress as 
possible before elections to renew the Euro- 
pean Parliament—which usually is very sup- 
portive of research—in May 2019. That will 
give the United Kingdom a chance to help 
shape the 7-year plan before it loses its seats 
in Parliament and the European Council. 
“We need to make the most of these channels 
whilst we can,’ Jessica Cole, head of policy at 
the Russell Group, a London-based group of 
24 leading U.K. universities, wrote in a blog 
post on 4 May. 

The United Kingdom has made clear 
that it wants to keep taking part in EU 
research programs after it leaves the bloc. 
This will require buying its way in through 
a bilateral association agreement, as other, 
smaller, non-EU countries such as Norway 
and Israel do. Other non-EU countries 
will be following the negotiations closely. 
Under Moedas’s mantra of “Open Science, 
Open Innovation, Open to the World,” the 
commission is likely to lift restrictions and 
make it easier for countries outside Europe 
and its immediate neighborhood to buy a 
stake in the research flagship—a sign that 
Europe’s horizons are widening further. 
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AIR POLLUTION 


United Kingdom unveils 
ambitious air pollution plan 


Bid to slash particulate pollution draws praise and doubts 


By Matthew Warren 


he United Kingdom wants to become 

a global leader in cleaning up polluted 

air—but the island nation may not be 

able to meet some of its ambitious 

new goals by itself. Last week, the U.K. 

government released a draft strategy 
for reducing air pollution that would impose 
the industrialized world’s strictest emission 
targets for tiny particles of soot and other 
compounds that can lodge in peoples’ lungs 
and shorten their lives. The plan would also 
tackle a source of particulate smog that is es- 
pecially hard to rein in: ammonia from farm 
fields and manure piles. 

Air quality researchers are applauding 
the goals of the 104-page Clean Air Strat- 
egy, which Prime Minister 
Theresa May’s government 
released on 22 May for pub- 
lic comment. But achieving 
those goals will be a chal- 
lenge, they caution, in part 
because much of the United 
Kingdom’s particulate pollu- 
tion drifts in from other na- 
tions. Even at home, cleaning 
up multiple, widespread pollution sources 
will require money and political will. 

The boldest goal is to halve, by 2025, the 
number of people breathing air with con- 
centrations of fine particulates—particles 
less than 2.5 microns wide—that exceed 
levels set by the World Health Organiza- 
tion (WHO). WHO’s particulate standard of 
10 micrograms per cubic meter (ug/m’), av- 
eraged over a year, is much tighter than the 
European Union’s limit, which is set to drop 
to 20 g/m? in 2020. (The U.S. standard is 
12 wg/m’.) 

That commitment is welcome, says 
Alastair Lewis, an atmospheric chemist at 
the University of York in the United King- 
dom and deputy director of the National 
Centre for Atmospheric Science in Leeds, 
U.K. Although it stops short of requiring 
the United Kingdom to meet the stringent 
WHO limit in every community in the na- 
tion, it will likely “become the de facto 
standard that people will hold [the govern- 
ment] to. You can’t put the genie back in 
the bottle and pretend you never said it.” 


Published by AAAS 


“They're fighting 
on an awful lot of 
fronts here... ” 


Alastair Lewis, 
University of York for 


To achieve the goal, the government pro- 
poses tackling emissions produced by do- 
mestic stoves and fireplaces, which account 
for 38% of the United Kingdom’s particulate 
pollution. New rules could limit the use of 
polluting fuels such as wet wood, for exam- 
ple, and raise emission standards for new 
stoves. The plan also envisions phasing out 
diesel-fueled trains by 2040 and tightening 
standards on vehicle tires and brakes, which 
throw off tiny particles during use. (The plan 
does not address other vehicle emissions, 
which a second document will cover.) 

Another key strategy is to reduce emissions 
of ammonia, which fuels chemical reactions 
in the atmosphere that produce an array of 
problematic particulates. Farm use of fertil- 
izers and manure from livestock and poultry 
produce about 88% of the 
United Kingdom’s ammo- 
nia emissions, so proposed 
plans would curb fertilizer 
use and require covering 
manure heaps to trap am- 
monia gas. Such steps could 
mark “a key turning point” 
controlling ammonia 

emissions, says environ- 
mental physicist Mark Sutton of the Centre 
for Ecology & Hydrology in Edinburgh. If 
current trends continue, Sutton notes, the 
United Kingdom will fall short of meeting 
ammonia goals set for 2020. 

But enforcing ammonia limits on thou- 
sands of farms will be complicated and 
could meet with resistance from agriculture 
interests. And although the government says 
it is demonstrating a commitment to go “fur- 
ther and faster than the EU” in combatting 
particulates, the United Kingdom will likely 
need help from other European nations to 
realize cuts, given that up to one-third of the 
nation’s smog—especially in the southeast- 
ern part of the country—drifts in from the 
continent in the form of precursor gases. 

The sweeping plan isn’t scheduled to be 
finalized until early 2019, and many of the 
details have yet to be hashed out. “They’re 
fighting on an awful lot of fronts here ... a 
huge, wide range of sectors and industries,” 
Lewis says. “The question is: Have they re- 
ally set out a case that they will have the 
resources and the capacity to do that?” 
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i By Adrian Cho 


hilip Hopkins, a theoretical astro- 
physicist at the California Insti- 
tute of Technology in Pasadena, 
likes to prank his colleagues. 
An expert in simulating the for- 
mation of galaxies, Hopkins 
sometimes begins his talks by 


projecting images of his creations 
next to photos of real galaxies and 
defying his audience to tell them apart. 


“We can even trick astronomers,” says 
Hopkins, a leader of FIRE, the Feedback 
in Realistic Environments simulation. “Of 
course, it’s not a guarantee that the mod- 
els are accurate, but it’s sort of a gut check 
that you're on the right track.” 

For decades, scientists have tried to sim- 
ulate how the trillions of galaxies in the 
observable universe arose from clouds of 
gas after the big bang. But in the past few 
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Galaxy simulations are finally producing 
realisti¢ results—and surprising insights 
into the evolution of the universe 


years, thanks to faster computers and bet- 
ter algorithms, the simulations have begun 
to produce results that accurately capture 
both the details of individual galaxies and 
their overall distribution of masses and 
shapes. “The whole thing has reached this 
little golden age where progress is coming 
faster and faster,’ says Tiziana Di Matteo, 
a numerical cosmologist at Carnegie Mel- 
lon University in Pittsburgh, Pennsylvania, 
and a leader of the BlueTides simulation. 
As the fake universes improve, their role 
also is changing. For decades, information 
flowed one way: from the astronomers 
studying real galaxies to the modelers try- 
ing to simulate them. Now, insight is flow- 
ing the other way, too, with the models 
helping guide astronomers, says Stephen 
Wilkins, an extragalactic astronomer at 
the University of Sussex in Brighton, U.K., 
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who works on BlueTides. “In the past the 
simulations were always trying to keep up 
with the observations,” says Wilkins, who 
is using BlueTides to predict what NASA’s 
James Webb Space Telescope will see when 
it launches in 2020 and peers deep into 
space and far back in time. “Now we can 
predict things that we haven’t observed.” 

For example, the models suggest that the 
earliest galaxies were oddly pickle-shaped, 
that wafer-thin spiral galaxies are surpris- 
ingly rugged in the face of collisions, and 
that to explain the evolution of the uni- 
verse, galaxies must form stars far more 
slowly than astrophysicists expected. 

The simulations also sound a cautionary 
note. Some cosmologists hope galaxy for- 
mation will ultimately turn out to be a rel- 
atively simple process, governed by a few 
basic rules. However, modelers say their 
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faux universes suggest that, like maturing 
teenagers, galaxies are unpredictable. It’s 
hard, for example, to tell why one turns 
into a graceful spiral but another evolves 
into a blob. “It’s clear from everything that 
we've done that the physics of galaxy for- 
mation is incredibly messy,” Wilkins says. 


BEFORE YOU CAN cook up a universe, you 
need to know the ingredients. From various 
measurements, cosmologists have deduced 
that just 5% of the mass and energy of the 
cosmos is ordinary matter like that in stars 
and planets. Another 26% consists of mys- 
terious dark matter that, so far, appears to 
interact only through gravity—and presum- 
ably consists of some undiscovered particle. 
The remaining 69% is a form of energy that 
stretches space and is speeding up the ex- 
pansion of the universe. That “dark energy” 
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A view of the present-day cosmic web 300 million 
light-years across, as modeled by IllustrisTNG. 
Galaxies (gold) have blown off shocked gas (white). 


may be a property of the vacuum of space 
itself, so physicists call it the cosmological 
constant, denoted lambda (A). 

Cosmologists also know the recipe’s ba- 
sic steps. The universe sprang into exis- 
tence in the big bang as a hot, dense soup 
of subatomic particles. Within a sliver 
of a second, it underwent an exponen- 
tial growth spurt called inflation, which 
stretched infinitesimal quantum fluctua- 
tions in the particle soup into gargantuan 
ripples. Slowly, dense regions of dark 
matter coalesced under their own gravity 
into a vast tangle of clumps and filaments 
known as the cosmic web. Attracted by the 
dark matter’s gravity, gas settled into the 
clumps, also called haloes, and condensed 
into the fusing balls of hydrogen called 
stars. By 500 million years after the big 
bang, the first galaxies had formed. Over 
the next 13 billion years, they would drift 
on cosmic gravitational tides and grow by 
merging with one another. 

Computer simulations helped develop 
that theory. In the 1980s they showed that 
to form clumps large enough to bind the 
observed clusters of galaxies, dark matter 
particles had to be slow moving and cold. 
The basic theory, which assumes a cosmo- 
logical constant, became known as A cold 
dark matter (ACDM). As the theory grew 
more refined, so did the simulations. By 
2005 the Millennium simulation, led by 
researchers at the Max Planck Institute 
for Astrophysics in Garching, Germany, 
produced a rendering of the cosmic web 
whose structure closely matched how the 
galaxies are strewn through space in clus- 
ters, threads, and sheets. 

Millennium and similar simulations suf- 
fered from a fundamental shortcoming, 
however. They modeled the gravitational 
interactions of dark matter alone, which 
are easy to simulate because, as far as scien- 
tists know, dark matter flows through itself 
without friction or resistance. Only once the 
haloes formed did the programs insert gal- 
axies of various sizes and shapes, following 
certain ad hoc rules. In such simulations, 
“The fundamental assumption is that the 
galaxies occupy the haloes and don’t do 
anything to them,” says Yu Feng, a cosmo- 
logist at the University of California (UC), 
Berkeley. “The interaction is all one way.” 

Now, modelers include the interactions of 
ordinary matter with itself and with dark 
matter—processes that are far harder to 
capture. Unlike dark matter, ordinary mat- 
ter heats up when squeezed, generating 
light and other electromagnetic radiation 
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that then pushes the matter around. That 
complex feedback reaches an extreme when 
gas clouds collapse into glowing stars, stars 
blow up in supernova explosions, and black 
holes swallow gas and spew radiation. Criti- 
cal to the behavior of galaxies, such physics 
must be modeled by using the equations of 
hydrodynamics, which are notoriously dif- 
ficult to solve, even with supercomputers. 

In general, modelers attack the problem 
by breaking it into billions of bits, either 
by dividing space into a 3D grid of sub- 
volumes or by parceling the mass of dark 
and ordinary matter into swarms of par- 
ticles. The simulation then tracks the inter- 
actions among those elements while ticking 
through cosmic time in, say, million-year 
steps. The computations strain even the 
most powerful supercomputers. BlueTides, 
for example, runs on Blue Waters—a super- 
computer at the University of Illinois in 
Urbana that can perform 13 quadrillion 
calculations per second. Merely loading 
the model consumes 90% of the computer’s 
available memory, Feng says. 

For years such simulations produced gal- 
axies that were too gassy, massive, and blobby. 
But computer power has increased, and, 
more important, models of the radiation- 
matter feedback have improved. Now, hydro- 
dynamic simulations have begun to pro- 
duce the right number of galaxies of the 
right masses and shapes—spiral disks, 
squat ellipticals, spherical dwarfs, and 
oddball irregulars—says Volker Springel, 
a cosmologist at the Heidelberg Institute 
for Theoretical Studies in Germany who 
worked on Millennium and leads the Illus- 
tris simulation. “Until recently, the simula- 
tion field struggled to make spiral galaxies,” 
he says. “It’s only in the last 5 years that 
we've shown that you can make them.” 

The models now show that, like people, 
galaxies tend to go through distinct life 
stages, Hopkins says. When young, a gal- 
axy roils with activity, as one merger after 
another stretches and contorts it, inducing 
spurts of star formation. After a few billion 
years, the galaxy tends to settle into a rela- 
tively placid and stable middle age. Later, 
it can even slip into senescence as it loses 
its gas and the ability make stars—a transi- 
tion our Milky Way appears to be making 
now, Hopkins says. But the wild and violent 
turns of adolescence make the particular 
path of any galaxy hard to predict, he says. 

The simulations are far from perfect. 
They cannot come close to modeling indi- 
vidual stars—even though the simulations 
point to the importance of feedback ef- 
fects on that scale, such as the winds and 
radiation from supernovae and from galax- 
ies’ central black holes. Instead, each grid 
element or particle stands for hundreds to 
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millions of solar masses of stars and gas, 
depending on the resolution of the simula- 
tion. Researchers then employ ad hoc “sub- 
grid” rules to describe how all that material 
behaves on average. “It’s like you're looking 
through foggy glasses and trying to describe 
this shape that you cannot see perfectly,” 
says Avishai Dekel, a cosmologist at the He- 
brew University of Jerusalem and a leader 
of the VELA simulation. 

Those ad hoc rules include dozens of 
parameters that researchers tune to repro- 
duce known features of the universe, such 
as the tallies of galaxies of different masses. 
That tuning raises the question of whether 
the models explain reality or merely mimic 
it, like a painting. But researchers say the 
models should be reliable as long as they 
avoid predictions that depend strongly on 
the tuning. “We're not going to get away 
from subgrid prescriptions, there’s no way,” 
Di Matteo says. “But this is not some kind of 
magic. It’s still physics.” 


THE MODELS HAVE already overturned some 
long-held assumptions. For example, astro- 
physicists believed that when two delicate 
disk galaxies like our Milky Way collide and 
merge, the process would wad them up into 
a single blobby elliptical galaxy. However, the 
models show that spiral galaxies are tougher 
than expected, if they hold enough gas. “You 
have disks partially surviving and recovering 
so quickly,’ Springel says. That finding was a 
big surprise, Hopkins says. 

The usual explanation of what deter- 
mines galactic size has also been knocked 
down, says Sandra Faber, an astronomer 
at UC Santa Cruz who works with VELA. 
Astrophysicists had assumed that a gal- 
axy’s size is determined by the spin of the 
dark matter halo enveloping it, with faster- 
spinning haloes producing larger, more 
diffuse galaxies, she says. But simulations 
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show no such connection, she adds. “We're 
now at a loss,” Faber says. “What makes a 
big galaxy big and a small galaxy small?” 

The shapes of newborn galaxies yield 
another surprise. Most galaxies today are 
spherical or oblate, like flattened spheres. 
Ellipticals are thick, like round cakes of 
soap; disks are much flatter. But the models 
predict that early in the universe, emerg- 
ing galaxies were prolate—longer than they 
were wide, Faber says. “They’re pickles,” she 
says. “You try to make a pickle out of gas. 
It’s not easy.’ NASA’s Hubble Space Tele- 
scope has begun to spot examples of these 
pickle-shaped galaxies, she says. 

The models predict other subtle phe- 
nomena that observers can try to spot. For 
example, astrophysicists had assumed that 
gas flows into a growing galaxy equally from 
all directions. However, the simulations 
show that gas pours into a galaxy in cold 
streams that flow along the dark matter fila- 
ments connecting its halo to the cosmic web, 


Simulations great and small 


Some models operate at cosmic scales, whereas others generate individual, realistic-looking galaxies (left). 
They divide space into volume elements or model matter as swarms of particles, then trace their interactions. 


been spotted. That deficit was dubbed the 
missing-satellites problem. 

But mix in the ordinary matter, and the 
predictions change. The gravitational push 
and pull between dark and ordinary matter 
smooths things out, reducing the number 
of small haloes. In those that do emerge, 
winds kicked up by supernovae tend to 
overwhelm the halo’s relatively weak gravi- 
tational pull and blow out the gas, starving 
the halo of the raw material to make more 
stars and snuffing out the nascent galaxy. 
Couple that process with the fact that ob- 
servers have now found 59 dwarf galaxies 
surrounding the Milky Way, and the dis- 
connect between observations and simula- 
tions largely disappears, Springel says. “I 
don’t see the missing-satellites problem as 
a problem anymore,” he says. 

Similarly, the older simulations suggested 
the concentration of dark matter should 
peak sharply at the very center of a halo. Yet 
the speeds of stars in nearby dwarf galax- 


MINIMUM 
SIMULATION SIZE NUMBER OF VOLUME ELEMENT MASS FIRST 
NAME (LIGHT-YEARS) ELEMENTS/PARTICLES (SOLAR MASSES) FOCUS PAPERS 
Millennium — 2.2 billion 10 billion Lbillion Dark matter only 2005 
VELA 45 million 500 million 1000 Individual galaxies 2009 
< FIRE 3 million-10 million Few hundred million- — 200-2000 Individual galaxies 2014 
L billion 
EAGLE 80 million— 100 million-7 billion 1.8 million Cosmic evolution 2014 
325 million 
BlueTides 1.9 billion 700 billion 2 million First galaxies 2015 
IlustrisTNG = 110 million-1 billion 270 million—30 billion = 1million—10 million Cosmic evolution 2018 


Dekel says. Observers with the Atacama Large 
Millimeter/submillimeter Array, a battery of 
66 radio dishes in Chile, have begun to peer 
into space for evidence of the streams. 

The simulations also aim to test the ba- 
sic theory of ACDM. By comparing real and 
simulated galaxies, researchers can test the 
assumption that dark matter interacts only 
through gravity. Any discrepancy might 
point to new interactions and help particle 
theorists figure out what dark matter is. 

None has been seen so far, but the newer 
simulations have patched up mismatches be- 
tween observations and earlier dark matter- 
only simulations. For example, 20 years ago, 
those simulations spawned swarms of small 
dark matter haloes around the bigger ones, 
which suggested that a galaxy like our Milky 
Way should be surrounded by hundreds of 
dwarf satellite galaxies. But only a few had 
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ies indicate that in their cores dark matter 
is spread out smoothly over a larger volume. 
The new simulations get that detail right 
because they capture how the gravitational 
effects of stars stir up the dark matter and 
spread it out. “Even if the stars are a small 
fraction of the mass, they really shake up the 
halo,’ Hopkins says. 

Perhaps the simulations' single biggest les- 
son so far is not that scientists need to revise 
their overarching theory of cosmology, but 
rather that problems lurk in their understand- 
ing of astrophysics at smaller scales. In par- 
ticular, their theory of star formation comes 
up wanting, Springel says. To produce realis- 
tic galaxies, modelers must drastically reduce 
the rate at which clouds of gas form stars 
from what astrophysicists expect, he says. 
“Basically, the molecular clouds form stars 
100 times slower than you'd think,” he says. 


IMAGE: A. WETZEL, P. HOPKINS, AND THE FIRE COLLABORATION 
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The life stages of a galaxy 


Galaxies evolved hand in hand with the large-scale structure of 
the universe. After the big bang, dark matter (blue) and ordinary matter 


Present distribution of 
dark and ordinary matter - 


(gold) filled space unevenly. The dark matter then began to coalesce 


the cosmic web. Computer models show how ordinary matter poured 


under its own gravity into a scaffolding of clumps and filaments known as ; 


into the clumps to form the first small, irregularly shaped galaxies, which 


grew over time in mergers. 


Original distribution of 
dark and ordinary matter 


Dark matter halo 


/ 
10 million 
years after the 
big bang 


1 Birth 

The seeds of galaxies 
lie in dense clumps 
of dark matter called 
haloes, which draw in 
the hydrogen gas that 
collapses into stars. 


Most likely, star formation flags be- 
cause feedbacks from supernovae and 
supermassive black holes drive gas out of a 
galaxy. Unfortunately, those processes are 
far too small to resolve in the simulations. 
When modelers deposit the energy of a 
supernova in a larger grid element, not much 
happens: Instead of generating wind, the en- 
ergy just radiates away. Similarly, research- 
ers cannot simulate the fitful way that black 
holes feed on gas and radiate x-rays. To cap- 
ture these key bits of astrophysics, modelers 
must rely on the ad hoc subgrid prescriptions 
that they tune by hand. 

Simulators hope to replace such crude 
assumptions with models based more sol- 
idly on physics. To do that, they’re hoping 
to enlist the help of astrophysicists work- 
ing on much more finely resolved models 
that simulate the birth of stars from mo- 
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Protogalaxies —F 


Cosmic web 


2 
6 billion 
years 
500 million 
years 
e— Stream 
of gas 
ms _ 2Childhood 
As stars turn on, the 
“ first small protogalaxies 
emerge, lumpy and 


pickle-shaped. Streams 
of cold gas, flowing along 
threads of dark matter, 
feed the galaxies and 
their central black holes. 


lecular clouds just a few light-years wide 
and even the evolution of individual stars. 
Those smaller-scale models are themselves 
works in progress. For example, astro- 
physicists modeling supernova explosions 
still struggle to make their virtual stellar 
time bombs go off. 

Nevertheless, Eve Ostriker, an astro- 
physicist at Princeton University who mod- 
els interstellar gas, says she’s eager to help 
put galaxy simulations on a sounder footing. 
“My interest in this is to replace the tuning 
with some physics and say, ‘OK, this is what 
it is, no tuning allowed,” she says. The hope 
is to string together results from different 
size scales in a way that minimizes the need 
for fudge factors, researchers say. “What you 
want is a picture that’s coherently stitching 
together across the entire range of scales,” 
Hopkins says. 


Published by AAAS 


A 


- to 


4 Middle age 

The galaxy settles down. As it ages further, 
radiation from the central black hole will eventually 
drive out gas, bringing star formation to a halt. 


Supermassive 
black hole ¢ 


Satellite —————o 
galaxy 


— Present — . 
(13.8 billion 
years) 


3 Adolescence 

The young galaxy grows through violent mergers, 
which trigger bursts of star formation, even 

as supernovae blow out gas and limit the process. 


Central 
black holes 


Supernova 


Ultimately, through observations and 
simulations, some researchers still hope to 
develop a unified narrative that can explain 
how any galaxy gets its shape and properties. 
Taking an extreme position, Faber predicts 
all galaxies will ultimately be sorted and ex- 
plained by just two parameters: mass and 
radius. “There’s a galaxy law that were only 
now discovering that makes it simple.” 

But many galaxy modelers believe the reci- 
pes will always becomplicated and uncertain. 
Galaxy formation may be like the weather, 
which keeps precise predictions forever 
out of reach because of its chaotic nature, 
Springel says. “I’m a little bit concerned 
that we’ll understand the big picture but 
never understand the details,” he says. In 
that case, the increasing realism of galaxy 
simulations may serve only to underscore a 
fundamental complexity in the universe. 
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BRIDGE: PESTICIDES AND SOCIETY 


SIGHT 


Decision-making in a storm of discontent 


Regulation of pesticides such as glyphosate needs to include societal assessment 


By Nico M. van Straalen' 
and Juliette Legler?* 


n 12 December 2017, the European 

Commission reapproved the use of 

glyphosate, the world’s most widely 

used active ingredient in herbicides 

and possibly the most heavily de- 

bated plant protection product since 
DDT (dichlorodiphenyltrichloroethane), for 
another 5-year period (1). Less than a week 
later, the U.S. Environmental Protection 
Agency (EPA) released its draft risk assess- 
ment for glyphosate, which concluded that 
human health risk levels associated with 
glyphosate exposure from food, drinking 
water, and residential sources are below 
the agency’s levels of concern (2). Both in 
Europe and the United States, these deci- 
sions faced a storm of public discontent 
owing to concerns about the possible risks 
of chemical exposures and the role of large 
multinational companies. We argue that a 
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broader societal assessment should be in- 
cluded in the decision-making process on 
pesticide registration. 


IS GLYPHOSATE SAFE OR NOT? 

The European Commission’s decision to reap- 
prove glyphosate was taken after lengthy and 
recurrent reviews by the European Chemi- 
cals Agency (ECHA), the European Food 
Safety Authority (EFSA), and the responsible 
ministries in member states of the European 
Union (EU). It was in line with conclusions 
reached by the Food and Agriculture Orga- 
nization of the United Nations (FAO)-World 
Health Organization (WHO) Joint Meeting 
on Pesticide Residues on the carcinogenic 
risk to humans from dietary exposure (3). It 
was not, however, in line with the Interna- 
tional Agency for Research on Cancer (IARC) 
of the WHO, which classified glyphosate as 
a probable human carcinogen in 2015 and 
again in an updated review in 2017 (4). The 
discrepancy between IARC’s conclusions and 
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the EU, FAO-WHO, and EPA assessments has 
been attributed to the use of different data- 
sets and methods to evaluate the data (5), 
which points to a lack of international stan- 
dardization of risk assessment procedures. 

Prior to the European Commission’s deci- 
sion and that of the EPA, nongovernmental 
organizations (NGOs) had fought hard for 
years to curb the use of glyphosate-based 
herbicides. This ongoing plea has found 
support in an increasing number of sci- 
entific peer-reviewed articles, which have 
argued that glyphosate may have more 
profound ecotoxicological effects than con- 
cluded by regulatory authorities (6, 7). 

In the EU, the campaign to end the license 
of glyphosate culminated in a European 
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Citizens’ Initiative that received over 1 mil- 
lion verified statements of support from 22 
EU member states and was submitted to the 
European Commission on 6 October 2017 
(8). The European Commission defended 
its position, arguing that “there are neither 
scientific nor legal grounds to justify a ban 
of glyphosate” (9). However, its decision to 
renew the license of glyphosate for 5 years, 
rather than the more common 10 or 15 years, 
can be viewed as a compromise, requiring 
the European Commission to revisit the ap- 
proval of glyphosate on a shorter-than-usual 
time scale. 


EXPLAINING THE DIVERGENT VIEWS 

There are several reasons for the large dis- 
crepancy of views between regulatory au- 
thorities (except the IARC) on the one hand 
and some academic groups, several NGOs, 
and a substantial part of the general public 
on the other. 

The glyphosate case is inherently com- 
plicated. Glyphosate inhibits a pathway of 
aromatic amino acid synthesis that is only 
present in plants, fungi, and bacteria and 
was therefore not expected to cause toxic- 
ity to animals (7). However, glyphosate is 
an active ingredient in many different her- 
bicide formulations. In addition to the ac- 
tive ingredient, formulated herbicides such 
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Glyphosate-based herbicide is sprayed to control 
weeds in an asparagus field in Brandenburg, Germany. 


as Roundup contain additional chemicals, 
called adjuvants, that improve herbicidal 
activity or application characteristics. The 
toxicity of several Roundup formulations to 
human cell lines has been mainly attributed 
to such adjuvants rather than to glyphosate 
itself (10). The same is true for effects on 
ecological targets, although a smaller but 
separate effect of glyphosate cannot not be 
excluded in these studies (17). 

Thus, many of the claims for glyphosate 
toxicity in the literature may be due to ef- 
fects of one or more adjuvants, acting ei- 
ther directly or through interaction with 
glyphosate (7). Adjuvants are 
authorized under European leg- 
islation by including them on 
the “Official List of Adjuvants,” 
but in the past these substances 
did not undergo a risk assess- 
ment comparable to the active 
ingredient. In its act on glypho- 
sate (1), the European Com- 
mission has now forbidden the 
use of polyoxyethylene tallow 
amines as adjuvants in plant 
protection products. 

Another issue complicating 
decision-making on _ glypho- 
sate-based herbicides is that 
their use is interwoven with 
the planting of genetically 
modified (GM) glyphosate-resistant crop 
varieties. The global use of glyphosate 
has increased 15-fold since the introduc- 
tion of GM crops in 1996 (12). Glyphosate- 
resistant GM crops are presently not grown 
in Europe, but their widespread culture 
in the United States and South American 
countries has resulted in appreciable resi- 
dues of glyphosate on plant products, such 
as soybeans, that are imported into Europe 
and used for food and feed production. 
This is aggravated by the use of glyphosate 
as a desiccant to kill foliage before harvest 
(6). The glyphosate residues are generally 
in compliance with maximum residue lev- 
els (MRLs) set by the EFSA; however, when 
the EFSA raised the MRL for glyphosate in 
lentils from 0.1 to 10 mg/kg in 2012, suspi- 
cion arose that thresholds principally set 
to protect consumers and husbandry ani- 
mals were adjusted to accommodate agri- 
cultural use of pesticides (6). 

The widespread and rising use of glypho- 
sate worldwide is also in itself a concern. 
Estimated global use was ~826 million kg 
in 2014, up from ~56 million kg in 1994 (12). 
A recent study found residues of glyphosate 
and/or its main metabolite, aminomethyl- 
phosphonic acid (AMPA), in 45% of Euro- 
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pean topsoils (13). The migration of herbi- 
cide residues into groundwater and their 
possible spread to drinking water extraction 
facilities is a serious concern. In addition, 
the effects of glyphosate on soil microbial 
communities and invertebrates in soils re- 
main largely undocumented and constitute 
a substantial knowledge gap. 


SOCIOECONOMIC ANALYSIS 

Apart from the inherent scientific complex- 
ity, the glyphosate case illustrates a funda- 
mental societal issue. The mere fact that 
the European Citizens’ Initiative attracted 
so many adherents is indicative of a wide 
societal aversion to the massive increase in 
the production of chemicals and their use in 


Activists demand the stop of glyphosate use in the European Union 
as they protest in Madrid, Spain on 8 February 2017. 


pesticide-based agricultural mass produc- 
tion. Public concern is not only limited to 
glyphosate but also covers other chemicals 
such as neonicotinoids, endocrine-disrupt- 
ing compounds, and food additives. 

The European Registration, Evaluation, 
Authorisation, and Restriction of Chemicals 
(REACH) regulation was adopted in 2007 
to improve the protection of human health 
and the environment from possible risks 
posed by chemicals other than pesticides, 
food additives, and drugs. The scheme re- 
quires a socioeconomic impact analysis as 
part of the authorization and restriction 
process (14). In this analysis, conducted by 
the ECHA’s Committee for Socio-Economic 
Analysis (SEAC), the socioeconomic ben- 
efits of authorization or restriction are 
weighed against the associated risks to hu- 
man health or the environment. A recent 
evaluation of the SEAC’s work showed that 
the ECHA’s experts struggle with estimat- 
ing the societal benefits and determining 
what costs involved in restricting the use of 
a chemical are proportional to the risks that 
are avoided through such restriction (15). 
However, the scientific basis and the meth- 
odologies for performing socioeconomic 
analysis are improving (14). 
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Presently there is no societal assessment 
in pesticide registration. Recently, a promis- 
ing framework for the combination of cost- 
benefit analyses with factors such as risk 
perception, uncertainty, and trust in regu- 
latory decision-making on toxic substances 
in food, including pesticides, has been pro- 
posed (16). We argue that including such a 
framework in pesticide authorization would 
be an appropriate way to take factors such 
as citizens’ initiatives, societal attitudes to- 
ward agricultural chemicals, and economic 
benefits of chemical pest and weed control 
into account. 

In a recent commentary, EFSA’s executive 
director, Bernhard Url, called for a broader 
societal discussion about the role of modern 
agricultural practices to be “framed ahead 
of and outside scientific work” (17). We be- 
lieve, however, that integration rather than 
a separate assessment is the way forward 
to move the discussion away from the pres- 
ent binomial decision of whether an indi- 
vidual pesticide is safe or unsafe. It is time 
for a new scheme for pesticide evaluation 
in which regulatory decision-making takes 
into account not only the technical evidence 
on safety but also the societal context in 
which decisions are made. 
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Dunes across the Solar System 


Despite a very thin atmosphere, dunes may form on Pluto 


By Alexander G. Hayes 


efore NASA’s New Horizons mission, 
the surface of Pluto was shrouded in 
mystery. No one knew what to expect 
from its surface and most scientists 
shied away from detailed specula- 
tion, except to say that the one thing 
we should expect is to be surprised (7). On 
page 992 of this issue, Telfer et al. (2) pres- 
ent such a surprise by describing features 
interpreted as dunes on Pluto’s surface. 

An eloquent, although perhaps dour, 
summary of the pre-New Horizons view of 
Pluto was given by the amateur astronomer 
Sir Patrick Moore in his 1955 book, Gwide 
to the Planets: “Beyond all doubt, Pluto is 
the loneliest and most isolated world in 
the Solar System—cut off from its fellows, 
plunged in everlasting dusk, silent, barren, 
and touched with the chill of death. Nature 
seems to have passed it by, and it can never 
have known the breath of life. It marks the 
frontier of the Sun’s kingdom” (3). 
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If Sir Patrick was with us today, New 
Horizons might have changed his opinion. 
Although Pluto is indeed isolated, nature 
has in no way passed it by. Images from 
New Horizons have revealed a geologically 
diverse and dynamic world driven by inter- 
nal heat, extreme seasons, and sublimating 
ices (4). Rather than representing the end 
of the Sun’s kingdom, Pluto perhaps repre- 
sents the gateway to an unexplored realm 
where volatile ices (consisting, for example, 
of nitrogen, methane, ammonia, and car- 
bon dioxide) dominate the landscapes of 
the dwarf planets and irregular bodies that 
make up the Kuiper Belt. New Horizons will 
provide a close-up view of this realm on 
New Year’s Day 2019, when it passes within 
3500 km of Kuiper Belt object 2014 MU69. 

In the latest discovery from New Hori- 
zons, Telfer et al. describe a collection of 
regularly spaced, linear ridges on Pluto’s 
surface. They interpret these ridges as 
transverse dunes of granular methane ice. 
The features are oriented orthogonal to 
dark wind streaks on Sputnik Planitia, a 
vast plain of nitrogen ice near Al-Idrisi Mon- 
tes, a collection of large water-ice blocks 
with up to 5 km of relief. With this discov- 
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In this image of Pluto’s surface, the red luminance 
corresponds to the infrared data acquired by the 
Ralph/MVIC instrument carried by New Horizons. 


ery, Pluto joins Earth, Mars, Venus, Titan 
(a moon of Saturn), and perhaps even the 
comet 67P/Churyumov-Gerasimenko (5), 
demonstrating that the mobilization and 
self-organization of granular material into 
dunes occur throughout the Solar System 
(see the figure). What makes this discovery 
surprising is that the sediment can be mo- 
bilized despite Pluto’s tenuous atmosphere, 
with a surface pressure (1 Pa) that is a factor 
of 100,000 times lower than that on Earth. 

The processes by which grains are mobi- 
lized in a fluid medium have long been stud- 
ied in both planetary science and terrestrial 
sedimentology. One of the earliest papers 
addressing this problem in a tenuous atmo- 
sphere was written in 1975 by Carl Sagan 
and Ralph Alger Bagnold, who investigated 
sediment transport on Mars (6). Despite the 
tremendous progress that has been made 
on the subject (7), many puzzles remain. 
Typical surface winds on Pluto are expected 
to be sufficient to transport particles once 
lofted, but even the strongest winds driven 
by interactions with the nearby Al-Idrisi 
Montes are not expected to be sufficient to 
begin lofting grains from rest. 

To overcome this problem, Telfer et al. 
suggest that motion is initiated by the 
sublimation of nitrogen ice from Sputnik 
Planitia itself. The idea of sublimation- 


induced lofting has previously been pro- 
posed to explain the erosional CO, troughs 
that form spiderlike structures at the south 
pole of Mars (8) and the dark streaks on 
Neptune’s moon Triton (9). In contrast to 
these previous studies, Telfer et al. inti- 
mately tie sublimation to dune formation 
in a tenuous atmosphere. If an extremely 
tenuous atmosphere like that of Pluto can 
support the generation of bedforms from 
wind-driven sediment, what kind of eolian 
activity might we see on places like Io (a 
moon of Jupiter) or Triton? 

Despite vast differences in gravity, atmo- 
spheric pressure, and sediment type, wind- 
blown sand dunes on Earth, Mars, Venus 
(10), and Titan self-organize into remark- 
ably similar landscape patterns (see the 
figure) (11). This property of dune pattern 
formation aided Telfer et al. and Jia et al. (5) 
in identifying candidate dunefields on Pluto 
and comet 67P, respectively. However, inter- 
preting formative mechanisms and envi- 
ronmental conditions from well-organized 
patterns alone can be hazardous. Nature 
tends to converge toward a set of relatively 
few forms and generic patterns using a va- 
riety of processes (12). Accordingly, much 
work is left to do to understand dunes on 
Pluto. Most notably, it remains to be shown 
how high the dunes are, when they are most 
active, whether they change, and whether 
entrainment can occur without lofting (5). 

Regardless, Telfer et al. convincingly ar- 
gue that lofted granular material entrained 


in a flow will self-organize into patterns 
and—given the right flow regime, deter- 
mined by the particle Reynolds number 
and Shields stress (13)—those patterns will 
be recognizable as dunes. With the require- 
ments of a thicker atmosphere for dune 
formation relaxed, it is perhaps not that 
surprising that dunes appear throughout 
the Solar System, so long as there is sedi- 
ment and a mechanism to mobilize it. 
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Dunes everywhere 
Similar dune-like patterns have been found on Solar System bodies including planets, moons, and a comet that all have very different gravity, 
atmospheric pressures, and sediment types. 


em a F a be 
Linear star dunes formed by changing winds in the 
Rub’ al Khali sand desert on the Arabian Peninsula. 
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Linear dunes diverted around topographic features in 
the Belet dunefield on Titan, Saturn's largest moon. 
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Acrescent-shaped dune with wind ripples in Nili Patera, 
one of the most active dunefields on Mars. 


oy 


Churyumov-Gerasimenko created by thermal winds. 
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Slowly changing dunes formed by gentle surface winds in 
the Fortuna-Meshkenet dunefield on Venus. 


Surface features interpreted as dunes in Sputnik Planitia, 
a vast plain of nitrogen ice on Pluto. 
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CANCER METABOLISM 


The nutrient environment affects therapy 


Nutrient availability affects cancer cell metabolism and therapeutic responses 


By Alexander Muir!’ and 
Matthew G. Vander Heiden!” 


nderstanding the molecular basis 

of cancer has led to a revolution 

in how cancers are classified and 

treated. Subsets of patients benefit 

from precision-medicine drugs that 

target growth-promoting signaling 
networks, but not all cancer patients re- 
spond to these approaches (J). Precision 
medicine is largely built on the assump- 
tion that cancer cell-intrinsic factors, such 
as genetic mutations or epigenetic identity, 
determine which pathways and processes 
are required in cells and thus response to 
therapies. However, in many cases, the pres- 
ence of particular genetic lesions is insuf- 
ficient to identify patients that will respond 
to a drug (J). For instance, stan- 
dard cell culture models have not 
identified the subsets of cancer 
patients that respond to most 
conventional chemotherapies 
(1). Nevertheless, these chemo- 
therapy drugs remain standard 
of care for many cancers and, 
in some cases, contribute to cu- 
rative regimens. Emerging data 
suggest that beyond cell-intrinsic factors, 
nutrient availability in the tumor micro- 
environment can also influence drug re- 
sponse. This highlights the importance of 
understanding the microenvironmental 
factors that dictate which cellular processes 
are essential for disease progression and ul- 
timately how tumors respond to treatments 
that target these processes. 

The success of conventional chemother- 
apy to treat patients with cancer argues that 
drugs that target cell metabolism and pro- 
liferative machinery, such as antimetabolite 
chemotherapy, can be effective. Recent ef- 
forts to target cancer metabolism have fo- 
cused on how different cell-intrinsic factors 
such as oncogenic mutations rewire metabo- 
lism to require cells to use specific metabolic 
pathways and nutrients for growth and sur- 
vival (2). There have been clinical successes 
from this approach, such as targeting mu- 
tant isocitrate dehydrogenases. However, the 
development of drugs that target enzymes in 


/Koch Institute for Integrative Cancer Research and the 
Department of Biology, Massachusetts Institute of Technology, 
Cambridge, MA, USA. “Dana-Farber Cancer Institute, Boston, 
MA, USA. Email: mvh@mit.edu 


962 1JUNE 2018 + VOL 360 ISSUE 6392 


core metabolic pathways, such as those that 
metabolize glucose, has been challenging. 
Even though profound metabolic alterations 
are observed broadly in cancer, an inability 
to match the right patients with specific 
drugs has limited therapeutic development 
of new molecules that target metabolism. 
Experiments in microorganisms have 
demonstrated that gene function and essen- 
tiality for proliferation and survival is largely 
environment dependent (3). That is, the avail- 
ability of nutrients in the environment can 
affect whether microorganisms require cer- 
tain genes to thrive. By extension, the tumor 
microenvironment may alter the essentiality 
of pathways in cancer cells and influence 
sensitivity to drugs ranging from classic che- 
motherapies to new, targeted agents (see the 
figure). Recent studies have demonstrated 


“defining physiological levels of nutrients 
and how physiology constrains cell 
metabolic processes could lead to a better 
understanding of drug responses...” 


that microenvironmental factors have a pro- 
found influence on cancer cell metabolism 
and sensitivity to drugs that target metabo- 
lism. For example, biochemical analysis of 
both mouse and human tumors revealed 
different nutrient preferences for lung and 
brain tumors in vivo compared with cancer 
cells cultured from these tumors. In cultured 
lung and brain cancer cells, glutamine is 
used as a primary carbon source for meta- 
bolic pathways needed to synthesize mac- 
romolecules for growth, whereas glutamine 
catabolism can be less important in tumors 
formed from the same cells in mice (2). Fur- 
thermore, genetic screens of human cells in 
culture and in xenograft tumors have yielded 
discordant results with respect to metabolic 
gene essentiality (4), lending further support 
to the idea that cancer cells require differ- 
ent metabolic processes, depending on their 
microenvironment. The microenvironment 
can also affect drug response. For example, 
inhibitors of the enzyme glutaminase slow 
the proliferation of most cancer cells in cul- 
ture, but this has not been predictive of tu- 
mor response to glutaminase inhibitors in 
either patients or mouse cancer models (2). 
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Similarly, drugs targeting mammalian target 
of rapamycin (mTOR), a protein involved in 
nutrient sensing and metabolic regulation, 
are much more successful at limiting cell 
growth and proliferation in cultured cells 
than they are at slowing tumor growth in 
mouse cancer models and in patients (5). 
These observations argue that the tumor mi- 
croenvironment affects cancer cell metabo- 
lism and can alter drug sensitivity. 

Standard cell culture models of cancer 
do not mimic the tumor microenvironment. 
One major difference between classical cell 
culture conditions, where most drugs are 
initially tested, and tumors is the level of 
available nutrients. Indeed, most culture 
media formulations were not intended to 
mimic tumor physiology but were instead 
derived from experiments to identify the 
minimal nutrients required to 
grow mammalian cells in cul- 
ture dishes (6). Levels of oxygen, 
which affect cancer cell metabo- 
lism and therapy response, have 
long been appreciated to be 
nonphysiologically elevated in 
standard culture conditions (7). 
Concentrations of other nutri- 
ents are different in tumors in 
vivo than in standard culture media, which 
alters cell metabolism and affects therapy 
response. For example, glioblastoma cells 
cultured in media that contains nutrients 
at physiological levels found in blood rely, 
to a lesser extent, on glutamine consump- 
tion for proliferation than the same cells 
in standard culture media (8). Importantly, 
glioblastoma tumors in mouse models in 
vivo produce rather than catabolize gluta- 
mine, and this difference has been attrib- 
uted to artificial nutrient levels in standard 
culture media creating a nonphysiological 
state with respect to glutamine metabo- 
lism. Similarly, culturing lung cancer cells 
in medium containing nutrients at physi- 
ological levels also reduces the dependence 
on glutamine catabolism and reproduces 
the lack of glutaminase inhibitor sensitivity 
observed in vivo (9). Many cell culture for- 
mulations contain high levels of the amino 
acid cystine, which drives glutamine catab- 
olism and sensitivity to glutaminase inhibi- 
tors. Cancer cells in tumors are exposed to 
lower cystine levels, which explains at least 
in part why glutaminase inhibitors are less 
effective at slowing the growth of tumors in 


sciencemag.org SCIENCE 


8LOz ‘€ euNP UO /Hio BeWaduaios a0UaINS//:dyy WO. papeojuUMOG 


GRAPHIC: C. BICKEL/SCIENCE 


mice derived from cells that are sensitive to 
these drugs in culture. Thus, nutrient levels 
are an important component of the tumor 
microenvironment that alters metabolism 
and drug responses. 

Environmental nutrient levels can also 
alter the requirement for “recycling me- 
tabolism.” In culture, intracellular protein 
recycling to obtain amino acids (JO) and 
recapture of acetate from histone modifi- 
cations (17) is dispensable for cancer cell 
proliferation, but both become required 
for tumor growth in vivo, where many nu- 
trients are more limiting. Catabolism of 
extracellular protein can also be used by 
cells as a source of amino acids (10). Most 
media formulations are protein deficient, 


Cancer cells in tumors have access to nu- 
trients that are not always added to stan- 
dard cell culture media. For instance, the 
presence of uric acid, a nucleotide break- 
down product found in vivo but absent 
from cell culture media, renders cells resis- 
tant to the pyrimidine analog and chemo- 
therapeutic drug 5-fluorouracil, which is a 
standard treatment for many cancers (72). 
Furthermore, the addition of pyruvate to 
cell culture media alters the cellular redox 
state, which limits the ability of the anti- 
diabetic drug metformin (which may also 
benefit some patients with cancer) to slow 
cancer cell proliferation (13). The altered 
sensitivity to these widely used drugs il- 
lustrates that microenvironmental nutrient 


The microenvironment influences phenotype 

Cell-intrinsic factors, such as cell lineage and genetic mutations, define a metabolic network that the cell is 
capable of using. However, the way this network operates is constrained by available nutrients and stromal 
interactions in the microenvironment. Thus, identical cells in different metabolic microenvironments exhibit 
distinct metabolic programs and variation in phenotypes, such as drug response. 


Environment A 


Conditions 
*0, or CO, 
+ Nutrient availability 


+ Stromal interactions 


=—;— Metabolic 
pathway A 


Drug | 


Phenotype A —------------- Phenotypes ~----- 


Nutrient availability has a major influence 
on cancer cell metabolic phenotypes: 


+ Proliferation 
* Nutrient utilization 


+ Drug sensitivity 


but otherwise amino acid rich, which may 
limit protein catabolism. However, cells 
grown with limiting amino acids but physi- 
ological levels of the protein albumin, the 
most abundant protein in blood and tis- 
sues, rely on albumin to obtain amino acids 
for growth (5). Interestingly, this switch in 
nutrient acquisition affects drug response. 
mTOR signaling promotes growth under 
amino acid replete conditions but con- 
strains protein catabolism. Therefore, cells 
relying on extracellular protein catabolism 
for growth are resistant to mTOR inhibition 
(5). Thus, differences in amino acid acquisi- 
tion between tumor cells and cells in cul- 
ture may contribute to the limited efficacy 
of mTOR inhibitors in the clinic. 


SCIENCE sciencemag.org 


* Metabolic pathway usage 


Environment B 


Metabolic | 


pathway B 


| Drug 


Be aes ree ee eee Phenotype B 


levels can affect metabolic pathway use and 
response to metabolism-targeted therapy. 
The tumor microenvironment also con- 
tains numerous cell types that can interact 
metabolically with cancer cells. Noncancer 
cells within a tumor can share metabolites 
with cancer cells, compete with cancer cells 
for nutrients, and provide signals that alter 
cancer cell metabolic pathway utilization. 
For example, competition between cancer 
cells and infiltrating lymphocytes for limited 
nutrients, and secretion of metabolic by- 
products by cancer cells, can create an immu- 
nosuppressive microenvironment that limits 
antitumor immune responses (J4). Thus, 
understanding the tumor microenviron- 
ment may even lead to improved therapies 
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that control cancer via noncell autonomous 
mechanisms such as immunotherapy. 
Relying on in vivo models to identify 
cancer targets is impractical, but current 
scalable ex vivo models are inaccurate with 
respect to microenvironment. Microenviron- 
mental factors clearly alter drug responses, 
and differences in nutrient levels between 
tumors and normal tissues could even drive 
targetable liabilities of cancer cells. Different 
nutrient levels across tissues might also in- 
fluence where specific cancers can thrive as 
metastases. Therefore, identifying how nutri- 
ents vary in tissue and tumor microenviron- 
ments and efforts to model this in tractable 
culture systems will be crucial to identify 
patients likely to respond to both existing 
and new drugs that target cancer processes, 
such as metabolism. Tumor nutrient levels 
also fluctuate both spatially and temporally 
within an individual tumor and could be af- 
fected by additional factors such as diet. This 
metabolic heterogeneity could be an impor- 
tant component of tumor heterogeneity that 
limits therapeutic effectiveness. Thus, defin- 
ing physiological levels of nutrients and how 
physiology constrains cell metabolic pro- 
cesses could lead to a better understanding 
of drug responses and uncover new thera- 
peutic opportunities. By combining knowl- 
edge of the nutritional microenvironment 
with models that consider other features of 
the tumor microenvironment, such as organ- 
oid culture, ex vivo models could be gener- 
ated that might have better predictive power 
for drug response in patients with cancer, as 
well as with other diseases (15). Ultimately, 
redefining culture conditions to incorporate 
knowledge of the tumor microenvironment 
could transform our understanding of cell 
physiology and eliminate a bottleneck in 
identifying new cancer therapeutics. 
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GENETICS 


Ancient human genomes—keys 
to understanding our past 


Ancient genomes from different times and continents are 
helping to understand past human migrations 


By Alessandro Achilli, Anna Olivieri, 
Ornella Semino, Antonio Torroni 


n 2000, when the term “archaeogenet- 

ics” (1) was initially proposed to describe 

the study of the human past by using the 

techniques of molecular genetics, ancient 

DNA studies were in their infancy. At 

that time, DNA data samples provided 
by geneticists to archaeologists, anthropolo- 
gists, and linguists for archaeoge- 
netic studies were almost entirely 
derived from living populations, with 
a few ancient DNA samples mainly 
restricted to mitochondrial DNA 
(mtDNA, or mitogenome). However, 
it was already clear that ancient DNA 
could play a much greater role once a 
number of major technical problems 
were resolved. Two studies in this is- 
sue exemplify the rapid progress that 
has been made since then. On page 
1028, Ebenesersdottir et al. (2) report 
the genomes of the first Icelanders, 
and on page 1024, Scheib et al. (3) use 
ancient Native American genomes to 
reconstruct the first phases of the hu- 
man spread in the Americas. 

In the early years of this century, 
no one anticipated the extent and 
speed at which ancient genomes and new in- 
ferences concerning prehistoric demographic 
events and historical accounts would be pub- 
lished. Moreover, ancient genomes are no 
longer retrieved solely from geographic re- 
gions with climate conditions that allow DNA 
to be well preserved, but are now recovered 
from sites all over the world. Today, we are in 
a second phase of the discipline initially bap- 
tized by Renfrew and Boyle (J), a phase that 
could be more accurately defined as the era 
of archaeogenomics, as exemplified by the 
two papers in this issue (2, 3) and by other 
very recent studies (see the figure) (4, 5). 


THE POWER OF GENETIC DRIFT 

Ebenesersdottir et al. analyzed 27 genomes 
from skeletal remains excavated at multiple 
sites in Iceland. This European island was 
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peopled in the second half of the 9th cen- 
tury by Vikings and their slaves who were 
mostly from Norway and the British-Irish 
Isles. The skeletal remains were selected 
for DNA studies because archaeological and 
radiocarbon dates indicated that they were 
from the earliest settlers of Iceland. The ge- 
nomes show that first-generation settlers 
were mainly unmixed, coming from either 
Norse or Gaelic sources (see the photo). This 


The skeletal remains of a male (SSG-A3) with grave goods. This individual 
was a first-generation Icelandic settler estimated to have an equal mix of 
Norse and Gaelic ancestry. 


finding was supported by the study of stron- 
tium 87 and 86 isotope ratios in the settlers’ 
tooth enamel; these ratios reflect the place of 
childhood residence through diet. 

Comparison with modern genomes 
yielded an unexpected finding. The ancient 
settlers’ genomes are more similar to those 
of the modern populations of Scandina- 
via and the British-Irish Isles than those 
of modern Icelanders. This observation 
highlights the power of genetic drift on a 
population that, for almost the entire first 
millennium after its foundation, rarely 
counted more than 50,000 people. 


ANCIENT GENOMES FROM AFRICA 

In a recent study, van de Loosdrecht et ail. (4) 
reported genomic data from seven individu- 
als from Grotte des Pigeons near Taforalt in 
eastern Morocco. These individuals are as- 
sociated with the Later Stone Age Iberomau- 
rusian culture and lived between 15,100 and 
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13,900 years ago. Their genomes are the most 
ancient analyzed so far from Africa, a techni- 
cal success that overcomes previous limita- 
tions in obtaining ancient DNA from tropical 
or subtropical regions. The most likely reason 
is that all Taforalt DNAs were extracted from 
petrous bones, where endogenous DNA is 
best preserved. 

Comparison with genomes from other 
ancient samples and modern populations 
reveal three major ancestry components 
that best characterize the Taforalt genomes: 
early-Holocene Levantines (Natufians), West 
Africans, and East African hunter-gatherers 
(Hadza). The first component indicates an 
important pre-Neolithic genetic input from 
the Near East, whereas the other two indicate 
a substantial input from sub-Saharan Africa, 
a contribution that is much less evident in 
modern North Africans. No gene flow signals 
from Paleolithic Europeans were detected, 
directly and definitively refuting a previously 
proposed European origin for the 
Iberomaurusian culture. 

As a single, maternally inherited 
locus, mtDNA often does not reflect 
the whole complexity of demographic 
events and is also particularly prone 
to genetic drift. However, after refine- 
ments in its evolution rate (6, 7) and 
the nesting relationships within its 
phylogeny, mtDNA can now provide 
rather narrow time boundaries for 
dating specific demographic events. 
All seven Taforalt specimens belong 
to two North African branches (hap- 
logroups) of the mtDNA tree: Six 
belong to mitochondrial haplogroup 
U6, and one belongs to M1. These 
two haplogroups are those proposed 
12 years ago, on the basis of mod- 
ern DNA data, as markers of an Up- 
per Paleolithic migration from the Levant 
to North Africa (8). At least for U6, this is a 
scenario that the ancient mitogenomes from 
Taforalt (and elsewhere) now confirm. 


MIGRATIONS OF THE FIRST AMERICANS 

The first ancient human genome sequence 
was from a ~4000-year-old Paleo-Eskimo 
from Greenland (9). In 2015, Raghavan et 
al. reported 23 ancient genomes from North 
and South American individuals, dated to 
between 6000 and 200 years ago (10). Com- 
parison with modern and other ancient ge- 
nomes led to the conclusion that modern 
Amerindian and Athabaskan groups derive 
from a single source population that began 
to diversify ~13,000 years ago, when it split 
into two branches: a northern one now re- 
stricted to North America (Athapaskans 
and northern Amerindians) and a southern 
one that is now dispersed across North and 
South America. The latter also includes the 
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genome of Anzick-1, a Clovis-culture male 
infant from Montana dated at ~12,600 years 
ago. The split was considered to have most 
likely happened in lower-latitude North 
America (south of the Cordilleran and Lau- 
rentide ice sheets) rather than in eastern 
Beringia (Alaska). 

Recently, the same group reported the ge- 
nome from another infant, in this case a fe- 
male from Upward Sun River, Alaska, dated 
at 11,500 years ago (5). Her genome be- 
longed to a Beringian-specific ancient popu- 
lation that had diverged from the ancestors 
of Amerindians and Athapaskans between 
22,000 and 18,100 years ago, showing that 
the ancestors of Native Americans were 
already differentiating during their stay in 
Beringia (“Beringian standstill”). More im- 
portantly, this infant Alaskan genome sug- 
gests that the split between the southern 
and northern branches occurred 17,500 to 
14,600 years ago, rather than ~13,000 years 


ferentiated before the split between the 
northern and southern Native American 
branches. ANC-A is largely associated 
with the southern branch (and Anzick-1), 
whereas ANC-B reaches its maximum in 
the northern ancient Ontarians and mod- 
ern Algonquians. However, both com- 
ponents are present and admixed in the 
ancient Californian populations as well as 
in present-day populations from Central 
and South America, including the Chilote 
and Huilliche, who show a very high ANC- 
B frequency despite living in Chile, not far 
from the Monte Verde site. 

To explain these findings, the authors pro- 
pose different models, which all assume an 
initial split between ANC-A and ANC-B south 
of the Cordilleran and Laurentide ice sheets 
(see the figure) and different possible merg- 
ing points between the two derived groups 
either in North America (once or multiple 
times) or along the way to South America. 


ern Colombians (JJ) is in full agreement with 
the scenario that ANC-B contributed to the 
shaping of South American genomes. 

Given an upper boundary of 17,500 years 
ago for the split time between the southern 
and northern branches (5) and the require- 
ment of a few thousand years of differen- 
tiation after the split (3), it may be time 
to reconsider an initial differentiation in 
Beringia. A reconsideration of the time 
and the place of the population split (see 
the figure) would also be compatible with 
the most recent archaeological findings 
in South America (72). However, a split in 
Beringia brings back the scenario of a dual 
entry into the American continent and thus 
the possibility that the two ancestries en- 
tered the Americas either at the same time 
or at different times, following the same or 
different routes (13-15). 

Considering the rapid progression in the 
production of ancient genomes, their den- 


Genome clues to ancient migrations 


Three studies (2-4) illustrate the power of archaeogenomics to elucidate how modern humans moved and mixed as they populated the globe. 


Current map Last Glacial Maximum map 
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~1.1 thousand years ago 


Iceland: The genomes of the earliest settlers more closely 
resemble those of modern inhabitants of Scandinavia and 
the British-lrish Isles than those of modern Icelanders. 


ago as previously proposed. This date range 
is much more realistic in light of the age 
(214,500 years) of Monte Verde in southern 
Chile, the oldest known archaeological site 
in South America. 

Scheib e¢ al. have now sequenced an even 
larger number of genomes—91, including 
mitogenomes—from ancient Native Ameri- 
can remains; only 27 of these remains were 
radiocarbon-dated, with ages ranging be- 
tween 4800 and 200 years ago (3). They 
were mainly selected from the Channel Is- 
lands of California and from Southwestern 
Ontario. Both regions show signs of human 
occupation as early as 13,000 years ago, 
were never reached by the much more re- 
cent ancient Neo- and Paleo-Eskimo migra- 
tions, and were expected to provide a good 
representation of the southern and north- 
ern Native American branches. 

Scheib et al. identified two distinct basal 
ancestries, ANC-A and ANC-B, which dif- 
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Morocco Za ‘ 


>20 thousand years ago 


North Africa: The ancestors of Late Stone Age people 
in Morocco came from the Near East and from 
sub-Saharan Africa, but not from Europe. 


However, they could not have begun to merge 
until a few thousand years after the split, tak- 
ing into account that the differentiation of 
ANC-A and ANC-B was not a sudden process. 
Alternatively, the split may have occurred 
north of the ice sheets in Beringia, but in this 
case, ANC-A and ANC-B would have theo- 
retically been susceptible to differential gene 
flow from other human groups in Beringia. 
Scheib et al. reject this possibility because the 
two ancestries do not differ significantly in 
their affinity to modern non-American popu- 
lations, including Asians. 

In their models, Scheib et al. do not ex- 
ploit the mitogenome information that they 
obtained. For instance, the ancient Ontarians 
harbored mtDNA haplogroups X2a and C4c, 
which are currently found at high frequency 
among the Algonquian-speaking popula- 
tions; they could thus be distinctive mito- 
chondrial markers of ANC-B. In this view, the 
detection of an early offshoot of C4c in mod- 
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175 to 15.0 thousand years ago 


Americas: Ancient Native American genomes suggest 
different models of how populations may have split and 
merged during the initial settlement of the Americas. 


sities on geographical maps are expected 
to increase very rapidly. There is no doubt 
that these data will allow many long-lasting 
questions to be addressed, but it is likely 
that many others will arise. 


REFERENCES 


1. A.C. Renfrew, K. V. Boyle, Eds., Archaeogenetics: DNA and 
the population prehistory of Europe (McDonald Institute 
for Archaeological Research, 2000). 

2. S.S.Ebenesersdottir et al., Science 360, 1028 (2018). 

3. C.L.Scheib etal., Science 360, 1024 (2018). 

4. M.vandeLoosdrecht et al., Science 360, 548 (2018). 

5. J.V.Moreno-Mayar et al., Nature 553, 203 (2018). 

6. P.Soares et al., Am. J. Hum. Genet. 84, 740 (2009). 

7. C.Posthetal., Curr. Biol. 26,827 (2016). 

8. A.Olivierietal., Science 314, 1767 (2006). 

9. M.Rasmussenetal., Nature 463, 757 (2010). 

10. M.Raghavan etal., Science 349, eaab3884 (2015). 

11. B.H.Kashanietal., Am. J. Phys. Anthropol. 147, 35 (2012). 

12. T.J. Braje et al., Science 358, 592 (2017). 

13. U.A.Peregoetal., Curr. Biol. 19, 1(2009). 

14. A.Achillietal., Proc. Natl. Acad. Sci. U.S.A. 110, 14308 
(2013). 

15. B.A. Potter etal., Science 359, 1224 (2018). 


10.1126/science.aat7257 


1 JUNE 2018 » VOL 360 ISSUE 6392 965 


Blog ‘LE Aew uo /6i0' Bewaeousios aoua!0s//:dyyy Wodj pepeojumMoq 


INSIGHTS | PERSPECTIVES 


BIOMATERIALS 


Neuromorphic circuits impart a sense of touch 


Flexible organic transistors transmit force signals encoded for direct input to human neurons 


By Chiara Bartolozzi 


he sense of touch is the ability to per- 

ceive consistency, texture, and shape 

of objects that we manipulate, and 

the forces we exchange with them. 

Touch is a source of information that 

we effortlessly decode to smoothly 
and naturally grasp and manipulate ob- 
jects, maintain our posture while walking, 
or avoid stumbling into obstacles, allowing 
us to plan, adapt, and correct actions in an 
ever-changing external world. As such, arti- 
ficial devices, such as robots or prostheses, 
that aim to accomplish similar tasks must 
possess artificial tactile-sensing systems. 
On page 998 of this issue, Kim et al. (J) re- 
port on a “neuromorphic” tactile sensory 
system based on organic, flexible, electronic 
circuits that can measure the force applied 
on the sensing regions. The encoding of 
the signal is similar to that used by human 
nerves that are sensitive to tactile stimuli 
(mechanoreceptors), so the device outputs 
can substitute for them and communicate 
with other nerves (e.g., residual nerve fibers 
of amputees or motor neurons). The pro- 
posed system exploits organic electronics 
that allow for three-dimensional printing 
of flexible structures that conform to large 
curved surfaces, as required for placing sen- 
sors on robots (2) and prostheses. 

The use of neuromorphic circuits—elec- 
tronic circuits that work under principles 
similar to those of our nervous system— 
leads to better efficiency, compactness, and 
fault tolerance. The resulting devices can 
interpret the often-ambiguous and noisy 
sensory signals to gather information about 
their surroundings that is useful for provid- 
ing sensory feedback to humans or robots. 
From those data, relevant information can 
be extracted to plan appropriate actions 
at an affordable computational cost and 
power budget (3). 

The system devised by Kim et al. is based 
on the encoding of tactile stimuli with 
trains of digital pulses (spikes or action 
potentials) like those exchanged in neural 
systems. In the brain, diverse features of 
the spike train, including the spike rate and 
the precise temporal pattern of the spikes 
(4), encode stimulus properties and also 
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the behavioral choices (5) associated with 
the stimulus itself. The neural code is at 
the same time rich and efficient, being an 
optimal choice to design artificial systems 
for sensing and perception. In this respect, 
Kim e¢ al. add to the current state of the 
art by using a neural representation that ef- 
ficiently encodes relevant stimulus charac- 
teristics, such as direction of motion, shape, 
texture, slip, and softness (6-8). 

This contribution confirms the idea that 
a neuromorphic approach to the implemen- 


thos AY NS 
The iCub robot gently holds a soft toy. Tactile infor- 
mation is used for grasping, in-hand manipulation, 
shape exploration for object classification, safety, 
learning a body schema, obstacle avoidance, 
and for perceiving and counteracting perturbations 
for balancing. 


tation of artificial tactile systems can be 
relevant in robotics (9), where tactile infor- 
mation can improve the interaction of the 
robot with objects. In this scenario, objects 
can be better recognized because touch 
complements the information gathered 
from vision about the shape of occluded 
or badly illuminated regions of the object, 
such as its texture or hardness (J0). Tactile 
information also allows objects to be bet- 
ter manipulated—for example, by exploit- 
ing contact and slip detection to maintain 
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a stable but gentle grasp of fragile or soft 
objects (see the photo). 

This approach applies not only to au- 
tonomous robots, such as industrial ma- 
nipulators and robot companions, that 
cooperate with humans in unstructured en- 
vironments, but also to remotely operated 
robots. A notable example is surgical robots, 
where the surgeon must finely calibrate the 
force applied to the tip of the surgical ro- 
bot. In this context, haptic feedback about 
tissue softness or hardness and texture can 
greatly help the surgeon improve the preci- 
sion of the intervention and ease of use of 
the instrument. 

Most important, natural haptic feed- 
back can minimize the continuous visual 
attention required to successfully perform 
manipulation in upper-limb prosthetics. 
Patients wearing functional prosthetics that 
can be actuated to perform daily living ac- 
tivities need to constantly visually monitor 
the execution of their actions to success- 
fully accomplish a task. An intuition of the 
required effort can be gathered when put- 
ting one’s hands in an ice bucket for some 
minutes; the low temperature impairs our 
tactile sensors, and we have difficulty even 
just maintaining a stable grasp. 

Information about shape, softness, slip, 
and contact forces also greatly improves 
the usability of upper-limb prosthetics in 
fine manipulation (/1). The few devices 
equipped with tactile sensing focus on this 
aspect by adding sensors at the fingertips. 
The advantage of the technology devised by 
Kim e¢ al. is the possibility of covering at a 
reasonable cost larger surfaces, such as fin- 
gers, palms, and the rest of the prosthetic 
device. Safety is enhanced when sensing 
contacts inform the wearer that the limb is 
encountering obstacles. The acceptability 
of the artificial hand by the wearer is also 
improved because the limb is perceived as 
part of the body, rather than as an external 
device. Lower-limb prostheses can take ad- 
vantage of the same technology, which can 
also provide feedback about the distribu- 
tion of the forces at the foot while walking. 

The contribution of Kim et al. fits in the 
general effort of deploying neuromorphic 
technology in neuroprosthetics (12-14). The 
spikes emitted by the proposed artificial 
tactile sensor (neural afferent) were directly 
used to stimulate a biological neuron that 
elicits muscle contraction. This hybrid bio- 
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electronic proof of concept paves the way 
to the use of this technology to restore 
muscular activation, such as in foot drop 
(abnormal gait caused by nerve damage) or 
bladder dysfunction. 

Important challenges lie ahead to trans- 
late this result to a breakthrough in the 
prosthetic (and robotic) domain. So far, 
each application has focused on a single 
dimension of the tactile stimulus, such 
as slip detection, texture, or shape, or 
direction-of-motion classification. The 
next generation of tactile sensory systems 
must integrate all of these to offer richer 
sensory information. The contribution of 
computational neuroscientists will be es- 
sential to this field in order to understand 
which feature of the neural code is related 
to each stimulus property and which neu- 
ral architecture can support the encoding 
and decoding of such information. 

In the proposed implementations, only 
one of the four types of mechanoreceptors 
of human glabrous skin has been emulated, 
and a single type of transduction is avail- 
able in each system. These single types of 
transduction relate to both the physical 
transduction properties (capacitive, resis- 
tive, piezoelectric, optical, etc.) and the 
spiking behavior (e.g., fast with respect to 
slow-adapting receptors). The challenge 
will be to develop a hybrid system with 
transduction sites of different bandwidth, 
dynamic range, resolution, sensitivity, and 
mechanical characteristics, with the help of 
materials science, and to implement neu- 
ral models that can take into account the 
different adaptation characteristics of skin 
receptors. The ultimate system will include 
temperature sensors and proprioception. 
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DEVELOPMENT 


A new view of embryo 
development and regeneration 


Transcriptome atlases of vertebrate embryos and 
regeneration enrich our understanding of development 


By Richard M. Harland 


mong the major milestones in de- 

velopmental biology are those that 

authoritatively describe how cells 

differentiate during development. 

Conklin (7) described the mosaic de- 

velopment of marine invertebrates 
(sea squirt), in which specific and recog- 
nizable cells of the dividing embryo consis- 
tently produce specific tissues of the larva 
and naturally pigmented myoplasm seg- 
regates to the muscle lineage of the larva, 
supporting the idea of localized cytoplasmic 
determinants that influence cell fate. Later, 
lineage tracer dyes were used to track the 
fates of different regions in amphibian em- 
bryos (2). More recently, careful observation 
produced the triumph of a complete cell 
lineage map of the nematode worm (Cae- 
norhabditis elegans) embryo (3). On pages 
980, 979, and 981 of this issue, Briggs et al. 
(4), Farrell et al. (5), and Wagner et al. (6), 
respectively, as well as Fincher e¢ al. (7) and 
Plass et al. (8), mark another innovation: a 
description of gene expression in all of the 
cells of fish and frog embryos (4-6) and in 
the adult regenerating planarian (flatworm) 
(7, 8). These papers describe the transcrip- 
tional information that forms the blue- 
prints of development and regeneration in 
these organisms. 

Although we now understand many 
mechanisms underlying developmental 
decisions, such as the inheritance of local- 
ized RNAs or stereotypical arrangement 
of signaling and responding cells, there is 
still a great deal that we do not know; for 
example, experiments have sampled many 
important regulators of cell fate, but the 
complete set of transcripts that regulate cell 
fate decisions has been unattainable. 

These studies demonstrate the power of 
single-cell RNA sequencing (scRNA-seq) 
in producing detailed and comprehensive 
plots of gene expression correlations with 
emerging cell fates. This technology has 
adapted previous single-cell microfluidics; 
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the embryo is dissociated into single cells, 
each of which merges with a drop contain- 
ing barcoded primers on a bead so that 
mRNA can be amplified for high-through- 
put sequencing of cDNA. For embryos, the 
dissociation and gentle handling of the large 
and delicate cells were a crucial advance 
in permitting the sequencing of enough 
cells to reconstruct gene expression in all 
cells, even in rare cells such as germ cells. 
The barcodes identify transcripts from the 
single cells they were extracted from, and 
similarities and differences in transcript 
composition sort the cells into clusters, rep- 
resenting states of differentiation. 
Technically, it is a challenge to capture all 
the different transcripts from a single cell, 


“These papers describe the 
transcriptional information 
that forms the blueprints 
of development...” 


so inevitably, some important transcripts 
can be missed. This technical limitation 
is further complicated by biological varia- 
tion in gene expression between similar 
cells. Although the recovered population of 
transcripts is not comprehensive, it is good 
enough to recognize cells that are similar, 
or different, and cluster them accordingly. 
By analyzing cells from different embry- 
onic stages, trajectories of differentiation 
can be reconstructed over time, because 
cells follow a progressive pathway of tran- 
scriptional changes as they differentiate. As 
cells become different, these paths branch 
to provide a molecular picture of emerging 
differentiation, and the transcripts that de- 
fine the differences can be identified. Sat- 
isfyingly, the differences include ones that 
have been identified and characterized by 
previous methods. Reassuringly, the vari- 
ous computational approaches to cluster- 
ing cell types provide consistent outcomes. 
Adding human annotation of the branches 
of differentiation, on the basis of known 
markers, superimposes them onto devel- 
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opmental anatomy, providing a reconstruc- 
tion of transcription in early embryos at the 
cellular level. This manual annotation can 
account for the loss of spatial information 
when the organism was dissociated and 
is complemented by a semiautomated ap- 
proach (5). 

The studies have developed impressive 
visualization tools that allow us to see how 
cells start to differentiate, and, after em- 
barking on specific lineages, how cell fate is 
reinforced and specialized over time. Over- 
all, the developing animals show an initial 
trunk of unresolved cell types, from which 
large branches and eventually “fine twigs” 
of specialization emerge. As the branches 
spread, the expression of known mRNAs 
confirms that cells are be- 
coming restricted to specific 
fates. From these datasets, 
we can infer the progress 
of individual cells over time 
and some of the variation in 
routes that cells take toward 
the differentiated state. Al- 
though most branches be- 
come progressively divided, 
an exception occurs where 
two divergent branches 
from neural crest and pha- 
ryngeal arches coalesce 
again, consistent with the 
idea that neural crest cells 
adopt a fate appropriate to 
their local environment (9). 

Importantly, the scRNA- 
seq data capture cells at 
branch points, a previously 
inaccessible cell state in 
which fate choices are made. 
Resolution of the decision 
shows which transcription 
factors are enriched and 
may determine cell fate. In 
most cases, it appears that 
decisions are binary, al- 
though there are examples where there is 
a choice between multiple fates. Specific 
cases document a mixed cell state, for ex- 
ample, in the choice between prechordal 
plate and notochord where a substantial 
fraction of cells appear bipotential, even 
after territories have been defined. Such 
cells must resolve their expression quickly 
to adopt the fate of their local community, 
or presumably be eliminated. 

Apart from the amount of information 
to be digested from these transcriptome 
atlases, what other nuggets have emerged? 
The information enriches what we already 
know and additionally identifies transcripts 
that had not previously been associated 
with pathways of differentiation. The si- 
multaneous expression of determinants 
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for multiple lineages in individual cells has 
been documented for a few cell types (J0) 
but is now documented for dozens of cell 
fate decisions. Examination of the tran- 
scription factors that are coexpressed at de- 
cision points will suggest new experiments 
on the cooperation between transcriptional 
regulators. Cells that appear to be superfi- 
cially similar, coming from the same region 
of the embryo, may express nonoverlapping 
transcripts, indicating a fundamental het- 
erogeneity in cell identity, something that 
has been difficult to establish by whole- 
tissue analysis. 

The fish and frog have been extensively 
studied, so a lot of the information in these 
studies provides fine detail, but in an ani- 


The gene expression landscape of a developing zebrafish embryo is visualized. Dots 
represent snapshots of individual cell states measured by scRNA-seq during the first day 
of development. Cells of the early embryo are at the center, with differentiating cells radiating 
along branches to form neural (red), epidermal (blue), and mesendodermal (green) tissues. 


mal like the regenerating planarian, which 
has only recently become a standard model 
of regeneration and development, there 
are new cell types and developmental tra- 
jectories to be discovered. The analyses of 
the regenerating planarian highlights how 
scRNA-seq provides a fast start to establish- 
ing the molecular anatomy of the organism, 
including rare cell types, which will acceler- 
ate research in this and other new models. 
The studies of Fincher et al. and Plass et 
al. on the regenerating planarian also il- 
lustrate, for a whole organism, what had 
been achieved in previous work on tissue 
renewal during homeostasis, where cells 
asynchronously differentiate from a vari- 
ety of stem cell precursors. The clustering 
of progressive changes in transcripts allows 
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the trajectories of differentiation from stem 
cells to be reconstructed in “pseudotime.” In 
these cases, they describe trajectories from 
neoblasts (stem cells) that describe the de- 
velopment of the entire organism. 

Where is this scRNA-seq technique 
leading? The studies of zebrafish embryo- 
genesis by Farrell et al. and Wagner et 
al. show that, in principle, scRNA-seq is 
a rapid assay to understand the pheno- 
types of mutants, with loss of Chordin, a 
bone morphogenetic protein antagonist, 
or loss of Nodal signaling. Previous work 
is confirmed, and the additional observa- 
tion that the alteration of signaling does 
not cause formation of any new cell types, 
but instead alters the proportions of cell 
types that would develop 
in the wild-type animal, is 
provided. This technique 
is currently prohibitively 
expensive for routine use, 
but, just as genotyping 
by DNA sequencing has 
largely replaced laborious 
genetic mapping, so too 
might scRNA-seq provide 
a more cost-effective ap- 
proach to understanding 
complex phenotypes than 
multiple rounds of hypoth- 
eses and experiments. The 
comparison of transcrip- 
tomes also highlights simi- 
larities and differences in 
distantly related species 
such as bony fish (zebraf- 
ish) and frogs that will lead 
to analysis of how different 
cell types may have evolved 
from common ancestors. 

Just as the emergence of 
genomic information forced 
communities to join together 
to annotate genomes, there 
is tremendous benefit to an- 
notating developmental processes and search 
for new insights and discoveries as a commu- 
nity. Now that these data are available, the 
door to imagining new uses is wide open. 
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GRADUATE EDUCATION 


Student-centered, modernized 
graduate STEM education 


Rebalance incentives to promote culture change 


By Alan I. Leshner 


he U.S. graduate education system for 

science technology, engineering, and 

mathematics (STEM) is widely re- 

garded as among the best in the world. 

However, evolution in the structure 

and functioning of the U.S. gradu- 
ate education system is lagging ongoing 
changes in the broader scientific enterprise, 
in the requirements of employers, in the de- 
mographics of the student population, and, 
particularly, in their career ambitions. In 
response to these disconnects, the U.S. Na- 
tional Academies of Sciences, Engineering, 
and Medicine (NASEM) recently released a 
report (J) that lays out a vision of an ideal 
modern graduate education in any STEM 
field and a comprehensive plan to achieve 
that vision. The report emphasizes core com- 
petencies that all students should acquire, 
a rebalancing of incentives to better reward 
faculty teaching and mentoring of students, 
increased empowerment of graduate stu- 
dents, and the need for the system to better 
monitor and adapt to changing conditions 
over time. Although these issues have been 
raised in the past, and some institutions have 
taken positive steps, graduate students are 
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still too often seen as being primarily sources 
of inexpensive skilled labor for teaching un- 
dergraduates and for performing research. 
Graduate students should demand the kind 
of education outlined in the report. 

There have been many other reports on 
graduate education produced by various 
groups since the National Academies’ last 
comprehensive report on this subject in 
1995 (2), and some of their recommenda- 
tions have been similar to those in the new 
report. Yet the system still lags modern 
trends and needs. For example, most stu- 
dents now pursue nonacademic careers, yet 
many institutions train them, basically, in 
the same way that they have for 100 years, 
to become academic researchers (3). Nu- 
merous surveys of employers and students 
suggest that many graduate programs are 
not preparing students adequately to suc- 
ceed in a variety of careers; employers are 
finding many recent graduates overly naive 
about research and development activities 
in government and industry. 

What is special about this latest report that 
makes one expect anything different will hap- 
pen? First, the report lays out a systems ap- 
proach to modernizing graduate education, 
emphasizing the need for coordinated action 
on the part of every stakeholder (including 
students, faculty, universities, government 
agencies and other funders, and employ- 
ers). Second, the report lays out a vision of 
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High-quality advising and mentoring are essential 
for student success in graduate school. 


an ideal 21st-century graduate STEM educa- 
tion, emphasizing the need for the system to 
become much more student-centered. Im- 
portantly, the report recognizes that achiev- 
ing its goals will entail substantial cultural 
change throughout the system and identifies 
key leverage points, particularly the incentive 
system that drives much of academic culture. 


A 21ST-CENTURY IDEAL 

The model proposed in this report builds on 
the strengths of the current approach that 
have made U.S. graduate education so effec- 
tive and attractive to students throughout 
the world but offers ways to make it more 
relevant to the needs of a very diverse stu- 
dent population, to prospective employers, 
and to broader society. Central to the ideal 
is for every student to achieve a set of core 
competencies that are the essence of being a 
master’s or doctoral level STEM professional. 
Achieving those competencies would ensure 
that students have acquired broad technical 
literacy as well as deep specialization in an 
area of interest. Students would acquire both 
an understanding of and ability to apply the 
ethics and norms of the scientific enterprise 
and to communicate findings to colleagues 
and the public. 

In the ideal, all prospective graduate stu- 
dents would be provided fully transparent, 
easily accessible data about career path- 
ways and successes of previous students, so 
they can make more informed choices about 
whether and where to go to graduate school. 
Moreover, students from all backgrounds 
would be encouraged and better enabled to 
participate fully and achieve their greatest 
potential in STEM fields in diverse, equitable, 
and inclusive environments. Students would 
be encouraged and given time and resources 
to explore a broad range of career options, 
through courses, seminars, internships, or 
other real-life experiences. Institutions would 
provide faculty with training and resources 
to improve their skills as advisers and men- 
tors. Departments and programs would help 
students identify advisers and mentors who 
can best support their academic and career 
development, and students would be encour- 
aged to sample multiple research groups and 
faculty styles before settling on a group in 
which to focus. They would have experiences 
working across disciplines and in teams. And 
students would be provided opportunities to 
communicate their work to both professional 
and public audiences and to understand the 
broader impacts of their research. 

Some universities now offer some or all of 
these kinds of experiences, and some disci- 
plines have moved further in this direction 
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than others. But this modern approach is far 
from universal. It should become the norm. 
If we do not make needed changes, we risk 
weakening the ability of our programs to at- 
tract the best and brightest students both do- 
mestically and internationally. 


EFFECTING CHANGE 

The proposed changes are not trivial and 
will require substantial attitudinal, behav- 
ioral, and cultural adjustments throughout 
the graduate education system. The entire 
system must become more student-centric. 
It must increase the value it places on high- 
quality teaching, mentoring, and advising 
students. The mindset that often stigma- 
tizes students with nonacademic research 
career goals needs to adjust to the reality 
of the 21st century, where most students do 
not pursue academic careers but will still 
contribute importantly in other sectors. The 
report lays out an action plan that includes 
steps to be taken by every stakeholder; suc- 
cess in modernizing the system requires 
that everyone does his or her part. 

Central to the success of this plan will be 
a readjustment of the incentives that drive 
so many attitudes and behaviors through- 
out the graduate education system. Those 
incentives currently are heavily weighted to- 
ward rewarding research productivity in the 
form of numbers of grants received, papers 
published, and future academic researchers 
produced. Although those metrics should 
remain important for some purposes, more 
weight needs to be given to providing qual- 
ity graduate education and mentoring if the 
other recommended changes are to happen. 

An essential step in rebalancing the aca- 
demic incentive system and, in turn, faculty 
behavior will be changes in federal, state, and 
private granting-agency funding policies, at 
a minimum in those cases where graduate 
education is being supported either directly 
or indirectly. As currently structured, fund- 
ing criteria for training grants or for support- 
ing graduate assistants on research grants 
typically emphasize traditional measures of 
research productivity and the numbers of 
academic researchers being produced, but 
place much less emphasis on the quality of 
teaching and mentorship that students re- 
ceive. Those criteria need to be rebalanced to 
give more weight to quality educational expe- 
riences that will benefit students no matter 
their career aspirations. 

The fact that some government agencies 
are already using funding policies to help 
shape graduate education is cause for opti- 
mism that this approach can be effective. For 
example, funding of training by all seven of 
the United Kingdom’s research councils (4) 
includes policies to stimulate changes in the 
graduate education paradigm, many in di- 
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rections consistent with the NASEM report’s 
recommendations. For example, the new U.K. 
funding criteria include quality of both men- 
toring and career advising. Similar require- 
ments are in place for institutional graduate 
training grants issued by the U.S. National In- 
stitutes of Health’s National Institute of Gen- 
eral Medical Sciences (5). Both are relatively 
new policy initiatives and are somewhat con- 
troversial, because their implementation will 
inevitably incur some financial and human 
resource costs. It will take some time before 
it is clear whether they are effective in lead- 
ing cultural change. Other examples include 
certain grants from the U.S. National Science 
Foundation (NSF) that require mentoring 
plans for postdoctoral researchers (6), which 
might provide models for similar policies at 
the graduate student level. 

The U.S. Congress can play an important 
role by advocating for these kinds of funding 
policy changes in both its authorizing and 
appropriation processes for federal research 
and development agencies. Those kinds of 
congressional actions (7) helped seed cultural 
change concerning the adoption and use of 
NSF’s “broader impacts” project review cri- 
terion. Adding that criterion was influential 
in ensuring that NSF grantees more seriously 
considered the consequences of their projects 
beyond the narrow limits of their work (8). 

Once funding criteria are rebalanced, it will 
be much more likely that there is a similar re- 
balancing of incentives in other components 
of the system, such as academic promotion 
and tenure policies. And when those incen- 
tives are adjusted to give greater weight to 
quality graduate education and mentoring, 
it will be much easier to make many of the 
other changes recommended in the report. 

But funding policy changes alone cannot 
produce all the changes needed in the grad- 
uate education paradigm. To achieve its vi- 
sion, the NASEM report specifies a detailed 
action plan, with explicit steps to be taken 
by each stakeholder. For example, although 
the details of implementation will vary ac- 
cording to local contexts, higher education 
institutions, their graduate schools, depart- 
ments, and faculty will have to ensure that 
they provide a truly inclusive and equita- 
ble environment to attract students from 
diverse backgrounds and enable them to 
achieve their full potentials. They will have 
to develop and provide training opportuni- 
ties for new faculty and refresher courses 
for established faculty in mentoring. They 
will have to work actively to reduce the 
stigma that students frequently report they 
experience if they are not interested in pur- 
suing academic careers. 

Prospective industry, nonprofit, govern- 
ment, and other employers should provide 
internships and course offerings to enable 
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graduate students to become familiar with 
and facilitate their transition into nonaca- 
demic careers. 


COSTS, BENEFITS, AND CONCLUSIONS 
The changes recommended in the NASEM 
report span the entire U.S. graduate educa- 
tion system. The recommendations will not 
be universally popular nor necessarily easy 
to implement. They will require courage, 
leadership, commitment, and persistence. 
In addition to considerable policy changes 
on the part of the institutions that fund aca- 
demia, success in modernizing the system 
will require changes in the way faculty and 
academic administrators view their roles 
and will alter to some degree the way they 
spend their time. Fortunately, many univer- 
sities are already moving in these directions, 
and some academic leadership organiza- 
tions have been working to tackle many of 
the same issues with the same goals and 
similar recommendations in mind. Their 
initiatives, one hopes, will help lead the way 
toward the needed system changes. 
Modernizing graduate STEM education 
will inevitably incur some additional costs, 
in both financial and human resource terms. 
Those costs have at times been used as an 
excuse not to make the needed changes, 
many of which were recommended in earlier 
studies of graduate education but not imple- 
mented across the system. But such invest- 
ments will likely pay off in the long term in 
more satisfied students and in a better-edu- 
cated, modern workforce in the expanding 
array of sectors of society where STEM pro- 
fessionals work. 
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The accessible predator 


Bloodthirsty murderers no more, captive killer whales 
helped to transform the species’s reputation 


By Robin W. Baird 


iller whales, also known as orcas, are 
idolized, loved, and even revered. 
Such sentiments, however, have not 
always been held toward this species, 
as historian Jason Colby reveals in 
his new book, Orca. 

From the 1940s into the 1960s, 
killer whales were often depicted 
as a menace to the rest of the 
ocean’s inhabitants, as well as 
to humans. The belief that they 
were a competitor for fish, for 
example, led local fishermen and 
US. soldiers with machine guns 
to kill hundreds of killer whales 
off the coast of Iceland in a single 
expedition in 1954. 

As recently as 1983, fisher- 
men in British Columbia were 
shooting killer whales. By that 
time, however, researchers and 
whale-watching operators in the area were 
horrified by such incidents, as the whales 
under attack were well known to both com- 
munities. This change in attitudes, Colby 
convincingly argues, largely came about be- 
cause killer whales were brought into cap- 
tivity starting in the mid-1960s. 
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Ted Griffin, an entrepreneur who estab- 
lished the Seattle Marine Aquarium, would 
become the first person to swim and per- 
form with a captive killer whale. Griffin 
purchased the killer whale Namu, named 
after his capture location in British Colum- 
bia, and displayed the whale on the Seattle 
waterfront beginning in 1965. 
Griffin’s account of “making 
friends” with Namu, published 
in National Geographic in 1966, 
helped rapidly change human at- 
titudes toward killer whales. 

Colby notes that over a 5-year 
period, about 20 million people 
viewed killer whales captured by 
Griffin. “To be sure, many visitors 
came away from such exhibits 
with misgivings about captivity, 
but most left more likely to view 
cetaceans as individuals and to 
care about the fate of whales in 
the wild.” 

Colby does an excellent job of framing 
these events within the larger environmen- 
tal movement of the time, as well as placing 
them within the context of the nationalism 
that was spreading on both sides of the U.S.- 
Canadian border at the time. (Frustrated by 
the sale of a Canadian-caught killer whale 
to an American marine park in 1965, the 
manager of Victoria’s Pacific Undersea Gar- 
dens lamented, “It seems that every time we 
get something good ... it gets funneled off 
to the U.S.”) 
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We aren't inclined to protect that with which we have 
no experience, observed David Attenborough. 


Perhaps more than any other individual, 
Griffin’s early efforts to capture and display 
killer whales led to the burgeoning of live- 
capture fisheries. But the methods employed 
to capture live killer whales, which often oc- 
curred in plain sight of local residents, could 
be violent and disturbing. The use of explo- 
sive “seal bombs” (loud noisemakers nor- 
mally used to drive seals away from fishing 
grounds), the practice of chasing whales into 
narrow inlets, and the inevitable deaths (and 
subsequent cover-ups) of some of the whales 
involved resulted in a tremendous backlash. 
Within a span of 7 years, Griffin went from 
being “féted by politicians, courted by Hol- 
lywood, and consulted by the Pentagon” to 
being “villainized by journalists, harassed by 
activists, and alienated from the region he 
had helped change.” 

This backlash, along with growing con- 
cern that this poorly understood whale 
population was not being well managed 
in captivity, led to public pressure to enact 
legislation to restrict whale capture and 
killing. “Only the Vietnam War generated 
more protest letters to the White House in 
the early 1970s than did whaling,” notes 
historian Kurkpatrick Dorsey. 

In 1971, the Canadian marine biologist Mi- 
chael Bigg began tracking wild killer whale 
populations (individual whales were identi- 
fied from photographs that revealed distinc- 
tive features—e.g., notches in the dorsal fin 
or color patterns on the skin) and studying 
their social organization and life history. Be- 
fore Bigg’s efforts—with the exception of Clif- 
ford Carl’s attempt to follow the movements 
of a single white killer whale in the 1940s 
and 1950s—killer whales were assumed to 
be indistinguishable from one another, and 
the population in the Pacific Northwest was 
thought to number in the thousands. 

It was soon revealed, however, that most of 
the whales captured or killed during live cap- 
ture’s heyday came from what we now know 
as the fish-eating southern “resident” popu- 
lation, a community numbering just over a 
hundred individuals at the time of the start 
of the capture operations. Live captures re- 
duced the population to about 70 individuals, 
and the captures of young females and juve- 
niles would limit its recovery for many years. 

Today the population is endangered, 
jeopardized not by removal of individuals 
for captivity but by degradation of their en- 
vironment and reduction of their prey base. 
With only ourselves to blame, do we still 
“care enough to save them?” asks Colby in 
the book’s closing pages. 
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Probing the microbial 


A wide-ranging text synthesizes what we know 
(and don’t know) about the microbiome 


By Fernando E. Vega 


ngela Douglas is an internation- 

ally recognized expert on symbiosis, 

with a number of foundational texts 

to her name (J, 2). In her new book, 

Fundamentals of Microbiome Science, 

Douglas synthetizes data from the bur- 
geoning field of microbiome science in eight 
highly informative chapters. Topics include 
the origins of the animal microbiome, what 
we know about the microbiome’s interac- 
tions with the immune system, hints at how 
microbes drive animal behavior, and how the 
gut microbiota are involved in gut- 
brain communication. The book 
also clearly delineates the influence 
of the microbiome in determining 
human health and disease. 

The microbiome revolution is 
expanding at breakneck speed and 
moving from “the study of correla- 
tion to causation and mechanism.” 
For example, mice lacking the leptin 
gene, which regulates satiety, con- 
sume more food and become obese. 
When the microbiota of obese mice 
are transferred to lean mice, the 
mice eventually become obese. 

The reciprocal mechanisms in- 
volved in the interactions between 
the immune system and the mi- 
crobiota are just beginning to be 
understood. In mammals, for ex- 


ample, the production of immuno- re 
globulin A prevents contact of the Grooming partne' 


microbiota with the epithelial cells 
of the gut, thus impeding microbe access to 
internal organs. Other innate immune effec- 
tors, including antimicrobial peptides, lec- 
tins, and certain enzymes, complement this 
effect, but many other factors (e.g., age of 
host, sex, and genotype) could also influence 
the outcome. The immune system can thus 
promote, tolerate, or inhibit the composition 
of the microbiota. In contrast, the microbiota 
can “promote or dampen immune system 
function,” and the effects of microbial prod- 
ucts on the regulation of immune effectors 
have been implicated in this process. 

The book includes interesting studies 
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972 1JUNE 2018 * VOL 360 ISSUE 6392 


on how the microbiome might influence 
complex processes, including circadian 
rhythms (e.g., the sleep-wake cycle), and 
mental health. One particularly interest- 
ing example is the possible involvement of 
Bifidobacterium infantis in mental health 
and, specifically, in the reduction of anxiety. 
When germ-free mice were subjected to re- 
straint stress, their plasma titers of corticos- 
terone and adrenocorticotropic hormone 
(stress hormones) increased, compared 


with conventional mice. The significantly 
higher titers were reversed when the germ- 
free mice were given B. infantis. 
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The microbiota have also been shown to 
influence animal communication and feed- 
ing behavior. Volatile carboxylic acids, for ex- 
ample, are known to emanate from the anal 
glands of many carnivorous mammals. These 
volatiles are the fermentation products of the 
microbiota that inhabit the glands and “play 
important roles in the social interactions of 
these mammals, including individual and 
group identity, territory marking, and recog- 
nition of mate and offspring.” 

Douglas discusses how signaling mol- 
ecules produced by microbes may represent 
a possible mechanism in the regulation of 
feeding behavior but cautions that the cur- 
rent data are “far from conclusive.” Simi- 
larly, she reveals that, although possible, 
“there is no definitive evidence” showing 
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that the microbiota regulate the production 
of feeding regulatory molecules in the host. 
This kind of tempered analysis is one of the 
many strengths of this book. 

In the final chapter, Douglas delves into 
some interesting and provocative topics 
and brings to the forefront the need to re- 
frame our understanding of animals. Here, 
she includes a discussion of climate change 
and its potential effects on microbiota. A 
widely reported consequence of climate 
change is an increased range for animals 
into higher elevations and higher latitudes. 
How, Douglas wonders, will microbiota ac- 
quired in these new environments influence 
animals? This is an area in which 
“data are almost entirely lacking.” 

For many years, I have studied 
the microbiota of coffee plants 
and of the coffee berry borer, a 
small bark beetle considered the 
most important insect pest of 
coffee worldwide. Reading Doug- 
las’s book resulted in a cavalcade 
of new research ideas, an effect I 
have only very rarely experienced 
while reading scientific literature. 
How did the coffee berry borer 
and its microbiota coevolve? How 
does the insect acquire these bac- 
teria and pass them on to its prog- 
eny? How do the microbiota differ 
between insects in cooler versus 
warmer coffee-growing areas? 

Douglas has an encyclopedic 
knowledge of microbial systems, 
their ecology, and their metabolic 
functions with which she explains 
complex physiological mechanisms and 
identifies important research gaps. A sum- 
mary at the end of each chapter succinctly 
recaps the preceding material, and 440 
highly useful (but unobtrusive) references 
direct readers to the primary literature on 
which Douglas has based her claims. Doz- 
ens of figures elaborate on the complex con- 
cepts discussed in the text. 

Overall, reading this book is a most satis- 
fying intellectual feast. B 
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LETTERS 


Edited by Jennifer Sills 


Call to restrict 
neonicotinoids 


Neonicotinoids are the most widely used 
insecticides in the world (J). They are 
applied to a broad range of food, energy, 
and ornamental crops, and used in 
domestic pest control (2). Because they are 
neurotoxins, they are highly toxic to insects 
(2), a group of organisms that contains the 
majority of the described life on Earth, and 
which includes numerous species of vital 
importance to humans such as pollinators 
and predators of pests (3). Neonicotinoids 
have proved to be highly persistent in 

the environment, such that substantial 
residues are commonly found in soils, wild- 
flowers, streams, and lakes (4). One recent 
study found neonicotinoids in 75% of 
honey samples collected from around the 
world (5). Hundreds of independent scien- 
tific studies have been performed to assess 
their impacts on beneficial organisms such 
as bees, aquatic insects, butterflies, and 
predatory beetles (4, 6). 

It is the view of the undersigned scien- 
tists that the balance of evidence strongly 
suggests that these chemicals are harming 
beneficial insects and contributing to the 
current massive loss of global biodiversity. 
As such, there is an immediate need for 
national and international agreements to 
greatly restrict their use, and to prevent 
registration of similarly harmful agro- 
chemicals in the future. On 28 April, the 
European Parliament voted for a complete 
and permanent ban on all outdoor uses of 
the three most commonly used neonic- 
otinoid pesticides (7). With the partial 
exception of the province of Ontario, 
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Canada (8), governments elsewhere have 
failed to take action. 

Failure to respond urgently to this issue 
risks not only the continued decline in 
abundance and diversity of many ben- 
eficial insects, but also the loss of the 
services they provide and a substantial 
fraction of the biodiversity heritage of 
future generations. 

Dave Goulson and 232 signatories* 
School of Life Sciences, University of Sussex, Falmer, Brighton, 


BN19QH, UK. Email: d.goulson@sussex.ac.uk 
*The full list of signatories is available online. 
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U.S. budget targets 
fish and wildlife work 


In 1935, embracing the principle that sci- 
ence should serve as the basis of federal 
wildlife policy, the U.S. Geological Survey 
(USGS) established the Cooperative Fish 
and Wildlife Research Unit Program 

(1). The Cooperative Research Units 
(CRUs) facilitate research among natu- 
ral resource agencies and universities 
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Neonicotinoids threaten aquatic insects, such as this 
mayfly, as well as species that rely on them for food. 


to inform decisions on how to manage 
millions of acres of land nationwide. 

The work of CRU scientists has helped 
guide hundreds of natural resource 
management decisions. Most recently, it 
has informed energy exploration on the 
Colorado Plateau and offshore areas of 
Alaska, a decision not to list the Sonoran 
desert tortoise as endangered, strategies 
to manage the Klamath River Basin to 
sustain its Chinook salmon, and surveil- 
lance of deer to prevent the spread of 
chronic wasting disease (2). Despite the 
CRUs’ measurable successes, the Trump 
Administration’s proposed FY 2019 federal 
budget—the starting point for the budget 
that will take effect on 1 October—calls 
for the program’s elimination, closing 40 
units in 38 states and terminating more 
than 700 projects (3). If implemented, the 
proposed budget cut would have a dire 
effect on research and academic jobs, the 
U.S. economy, and the preservation of the 
country’s flora and fauna. 

University and state agency support, 
facilitated by CRUs, multiply the return 
on the USGS’s modest investment in this 
wildlife science. Federal withdrawal of 
CRU funding would dissolve partnerships 
that provide office space, courtesy faculty 
appointments for unit leaders, access 
to graduate students, and state funding 
for CRU research that informs manage- 
ment of public lands. Because the faculty 
who lead the CRU research are USGS 
employees, withdrawal of federal funding 
support would result in the termination of 
faculty members scattered across 38 states 
and essentially shut down all the research 
projects they lead or oversee. 
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Hunting, fishing, and other outdoor 
recreation generate annual sales of $887 
billion, support 7.6 million American jobs, 
and contribute $65.3 billion in federal 
tax revenue (4). This economic engine is 
fueled in large part by the science that 
CRUs provide to state and federal agen- 
cies. Such research allows these agencies 
to manage the conservation and protec- 
tion of the wondrous natural settings on 
which current and future generations 
of hunters, fishers, hikers, and camp- 
ers depend. With the closure of the CRU 
program, we discard one of the most 
powerful tools we have for the long-term 
sustainability of the industries that rely 
on the responsible management of our 
natural resources. 

The loss of the CRU program would put 
U.S. ecosystems and species at risk. These 
cuts would halt research on some of the 
nation’s most iconic species: elk, moose, 
mountain lion, turkey, and largemouth 
bass (5). Science provided by CRUs also 
underpins decisions to classify species’ 
status (2), allowing for appropriate pro- 
tection measures to be taken. 

The American Fisheries Society and The 
Wildlife Society have sent a joint letter of 
concern to the U.S. Senate, explaining that 
the demise of CRUs would substantially 
reduce the ability to manage the nation’s 
fish and wildlife resources (6). I urge 
legislators and policy-makers to reject the 
administration’s proposal and instead to 
fully fund the CRUs. 

Jack M. Payne 
University of Florida Institute of Food and 


Agricultural Sciences, Gainesville, FL 3261 
Email: jackpayne@ufl.edu 


, USA. 
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A Taiwanese 
student explores 
the properties 
of tough gels. 


OUTSIDE THE TOWER 


Science transcends cultures in Taiwan 


As my co-instructor, Sam, waxes lyrical about tough gels, | survey the hall: The timid 
faces of 144 of the brightest students in Taiwan look back at us, uncertain of what to 
expect. It’s very different from the lively atmosphere we are accustomed to in New 
Zealand classrooms. 

Our team has traveled 10,000 km to participate in the Madame Curie Senior High 
School Chemistry Camp (1). | have longed to bring the science-through-play ideals of 
our outreach program to the students of my home country. Many Taiwanese students 
grow up under immense academic pressure and rarely get to experience hands-on or 
carefree childhood activities. Today we intend to change that. 

“It's just for fun, so don't worry about getting everything right. Prepare to get 
messy!” | repeat Sam's words and sentiment in Mandarin, and then the students begin 
experimenting with PVA/borax slime, modified using cornstarch to simulate a tough 
gel. Reserved smiles turn into joyous laughter as the slime stretches and dribbles 
off their fingers. The ice is broken! We move on to some “real” chemistry: measur- 
ing stretch in nano-clay composite tough gel samples. In this make-shift venue, the 
perceived formality of science in laboratories has been stripped away as we showcase 
Sam’s research. 

Play first; measurements later. We elected to reverse the usual order of experiments 
because, even more so than Kiwi kids, these students needed a fun and relaxed envi- 
ronment to facilitate engagement. Working with an understanding of their culture and 
language, we were able to show them that science is everywhere, and that anyone can 
do it and have fun along the way. 


Jacqueline Kao™ and Sam P. H. Sutherland? 

‘Science Knowledge Ltd., Taipei 11444, Taiwan. *Department of Chemistry, University of Otago, 
Dunedin, 9010, New Zealand. 

*Corresponding author. Email: scienceknowledgeltd@gmail.com 
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Kim et al., p. 998 


Seeds of the small 
mustard plant 
Arabidopsis thaliana 
in their pod 


The global impacts of 
food production 


Food is produced and pro- 
cessed by millions of farmers 
and intermediaries globally, 
with substantial associated 
environmental costs. Given 

the heterogeneity of produc- 
ers, what is the best way to 
reduce food's environmental 
impacts? Poore and Nemecek 
consolidated data on the 
multiple environmental impacts 
of ~38,000 farms producing 
AO different agricultural goods 
around the world in a meta- 
analysis comparing various 
types of food production sys- 
tems. The environmental cost 
of producing the same goods 
can be highly variable. However, 
this heterogeneity creates 
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opportunities to target the small 
numbers of producers that have 
the most impact. —AMS 

Science, this issue p. 987 


Predation favors the 
unadventurous 


Selection is likely to shape 
behavior by acting on behav- 
ioral differences between 
individuals. Testing this idea has 
been challenging. Lapiedra et 
al. took advantage of a chain of 
small islands in the Caribbean 
colonized by anole lizards. A 
series of repeated behavioral 
selection experiments were 
set up in which brown anole 
populations were established 
with and without predators. On 
predator-free islands, animals 


that were more exploratory 
were favored, whereas when 
predators were present, less 
adventurous animals survived 
better. Selection for behavior 
occurred simultaneously with 


The Caribbean brown anole lizard, 
Anolis sagrei 
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Controls on seed 
dormancy 


erbivores and an inop- 
portune cold snap can 
destroy fragile plant 
seedlings. Plants control 
the dormancy of their 
seeds in anticipation of more 
favorable growth conditions. 
Chen and Penfield analyzed 
the molecular controls on 
seed dormancy in the model 
plant Arabidopsis thaliana. Two 
genes and an antisense RNA, 
known from the process of ver- 
nalization, integrate ambient 
temperature to control seed 
dormancy via their opposing 
configurations. —PJH 


Science, this issue p. 1014 


predominant when predators 
were present. —SNV 


Triggering quakes in a 
geothermal space 


Enhanced geothermal systems 
(EGSs) provide a potentially 
clean and abundant energy 
source. However, two magni- 
tude-5 earthquakes recently 


EGS site development. Grigoli 
et al. and Kim et al. present 
seismic and geophysical evi- 
dence that may implicate the 
second of these earthquakes, 
which occurred in Pohang, as 
an induced event. The com- 
bination of data from a local 
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morphological selection but was 


Science, this issue p. 1017 


occurred in South Korea during 
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seismometer network, well logs, 
satellite observations, teleseis- 
mic waveform analysis, and 
stress modeling leads to the 
assessment that the earth- 
quake was probably or almost 
certainly anthropogenically 
induced. The possibility remains 
that the earthquake occurred 
coincidentally at the EGS site 
location, but the aftershock 
distribution and other lines of 
evidence are concerning for 
future development of this geo- 
thermal resource. —BG 

Science, this issue p. 1003, p. 1007 


POLITICAL SCIENCE 
Curtailed conversations 


ost articles written about U.S. 
politics in the past few years 
have mentioned the increasing 
polarization of the electorate. 
But is this real, or does it merely 
reflect the increasing polariza- 
ion of the media? Chen and 
Rohla estimate that in 2016, 
Thanksgiving dinners in which 
he hosts and guests lived in 
oppositely voting precincts 
were up to 50 minutes shorter 
than same-party-precinct din- 
ners. That is, family members, 
adjured to avoid talking about 
contentious subjects, may 

have simply talked less. —-AMS 
and GJC 

Science, this issue p. 1020 


ORGANIC CHEMISTRY 
Arenes get a light boost 
onto copper 


Insertion of palladium into an 
aryl halide bond is the first 

step in numerous variants of 
cross-coupling chemistry used 
o make carbon-carbon bonds. 
Copper is an appealing alterna- 
ive catalyst for such reactions 
because of its abundance and 
downstream reactivity profile. 
However, this preliminary step, 
ermed oxidative addition, is 
often prohibitively slow for the 
cheaper metal. Le et al. report a 
photocatalytic way around this 
problem. A photoredox catalyst 
paired with a silane can activate 
aryl bromides to react with 
copper, likely via aryl radicals. 
The copper in this case then 
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catalyzes trifluoromethylation 
of the arenes. —JSY 
Science, this issue p. 1010 


NEUROPROSTHETICS 
Aleg up for 
neuroprosthetics 


Amputation severs bone, 
nerves, and muscles needed 
for limb movement, limiting an 
amputee's ability to sense and 
control a prosthesis. Clites et al. 
tested autologous muscle-nerve 
interfaces made at the time of 
below-knee amputation in a 
human subject. In comparison 
with traditional amputations, 
the subject who received 
myoneural interfaces in his 
residuum (which were con- 
nected via synthetic electrodes 
to his powered prosthesis) 
had better control during stair 
walking. This person noted little 
delay between intention and 
movement of his prosthesis 
and expressed a strong sense 
that the prosthesis was part of 
him. —CC 

Sci. Transl. Med. 10, eaap8373 (2018). 


IMMUNE REGULATION 
Unexpected basophil 
activation 


Basophils are granulocytes 
that exist at a relatively rare 
frequency in the blood but 
are critical mediators of 
allergic and inflammatory 
responses. Regulatory T cells 
(Tiegs) Suppress the functions 
of different immune cells. 
Sharma et al. examined how 
Tregs Fegulate basophil functions. 
Unexpectedly, resting human 
basophils were activated and 
not suppressed in the presence 
Of T,...: [he activated basophils 
expressed activation markers— 
CD69, CD203c, and CD13—and 
released interleukin-4 (IL-4), 
IL-8, and IL-13. T,,,-induced 
activation of basophils involved 
IL-3 and STAT5 but was not 
contact-dependent. These 
counterintuitive results showing 
activation mediated by T,.,,, pro- 
vide insight into how basophils 
are regulated. —CNF 

Sci. /mmunol.3, eaan0829 (2018). 


IN OTHER JOURNALS 


ORGANIC SYNTHESIS 


Edited by Sacha Vignieri 
and Jesse Smith 


Electron density 

map of polycyclic 
aromatic hydrocarbons 
inside a zeolite cage 


Acetylene in, PAHs out 


olyaromatic hydrocarbons (PAHs) are attractive for 
optoelectronics applications, but their synthesis can 
be challenging. Solution routes tend to have low yields, 
and high-temperature vacuum pyrolysis (~1000°C) 
generates numerous carbonaceous side products. Ko 
et al. synthesized PAHs by polymerizing acetylene in Ca?* 
ion—exchanged Linde type-A zeolites at ~400°C. The Ca?* 
ions appear to lower the barrier for carbon-carbon bond 
formation. The PAHs remained trapped in the a-cages of 
the zeolite but were recovered after dissolving the zeolite 
with acid. The product mixture exhibited white fluorescence 
in N-methylpyrrolidone solution. Column chromatography 
separation isolated violet, blue, green, and orange emit- 
ters, with coronene being the main component of the blue 


emitter.—PDS 


J. Am. Chem. Soc. 10.1021/jacs.8b00900 (2018). 


NEURODEGENERATION 
Keeping stress granules 
in check 


Stress granules are membrane- 


less organelles composed of 
RNAs and proteins that can 
accumulate in the presence of 


potentially pathogenic proteins. 


These include proteins linked 
to neurodegenerative disorders 
such as C9-ALS/FTD (amyo- 
trophic lateral sclerosis and 


Published by AAAS 


frontotemporal dementia linked 
o CIORF72). The pathomecha- 
nisms of these same disorders 
can involve disruption of 
nucleocytoplasmic trans- 

port. Zhang et al. found that a 
variety of components of the 
nuclear import machinery were 
sequestered in stress granules, 
which reduced nucleocytoplas- 
mic transport. Inhibiting stress 
granule assembly suppressed 
these transport defects in cells, 
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VERTEBRATE 


PALEONTOLOGY 


Island life 


mong modern faunas, a well-established phenomenon occurs as species evolve to live on 

islands. Specifically, many species undergo dwarfism and other related changes after island 

colonization, likely owing to resource limitation and restriction of landscapes. Csiki-Sava et af. - 

describe a multituberculate mammal from a-Cretaceous island in Romania that displays 

several unusual traits, notably a reduced brain size and domed skull. The authors argue that 
these changes reflect adaptation to the island habitat of this group and, further, that a small brain — 
(relative to body size) may be a previously undescribed adaptation to insular living. —SNV 


Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.1801143115 (2018). 


and it also interfered with the 
neurodegeneration process in 
patient-derived iPS (induced 
pluripotent stem cell-derived) 
motor neurons and ina fly 
C9-ALS disease model system. 
—SMH 


Cell 173, 958 (2018). 


HOST DEFENSE 
Bacteria restricted via 
C3-mediated autophagy 


The complement system 

has several important host- 
defense functions. Complement 
component C3, for example, 
can enhance phagocytosis, 
contribute to the bactericidal 
membrane attack complex, 

and initiate adaptive immune 
responses against invading 
microorganisms. Sorbara et 

al. uncovered another mecha- 
nism by which C3 can control 
pathogens. They observed inter- 
actions between the autophagy 
protein ATG16L1 and C3. In 
opsonized intracellular bacteria 
such as Listeria, this resulted 

in increased targeting to the 
autophagy system (xenophagy) 
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and, in turn, greater autophagy- 
dependent growth restriction. 
Certain intracellular bac- 

teria, such as Shigella and 
Salmonella, were able to escape 
C3-mediated targeting via the 


omptin proteases IcsP and PgtE, 


which cleave complement com- 
ponents including C3. Mouse 
models reveal that C3-mediated 
autophagy-dependent restric- 
tion may be important in 
protecting host mucosal tissues 
during the early stages of 
Listeria infection. —STS 


Cell Host Microbe 23, 644 (2018). 


POLYMER COLLOIDS 
Structured in a flash 


Block copolymers will assem- 
ble into a range of complex 
structures driven both by 
thermodynamics, as defined 

by the chemical makeup of the 
blocks, and by the kinetics of 
the chain motion. When formed 
into sheets, these structures 
can range from those with 
regular periodicity to those with 
complex, tortuous, random 
interconnected phases. Grundy 
et al. developed a scalable 


process in which a feed stream 
containing polymers in a solvent 
contacts a counterstream 
based on a poor solvent for the 
polymers, which causes the 
polymers to precipitate out. 
Depending on the choice of 
polymers, one can get anything 
from simple Janus particles or 
layered ones to far more com- 
plicated, internally structured 
particles made by kinetic trap- 
ping of nonequilibrium phases. 
—MSL 
ACS Nano 10.1021/ 
acsnano.8b01260 (2018). 


EDUCATION 

A (dis)course in postdoc 
identities 

Adapting postdoctoral training 
to include the development of 
skills that are translatable to 
careers outside of the labora- 
tory is critical for diversifying 
the STEM (science, technology, 
engineering, and mathemat- 
ics) workforce. We do not know 
much about the scientific 
identities of postdoctoral 
researchers (postdocs), which 


Published by AAAS 


could ultimately limit their 
understanding of additional 
STEM career opportunities. 
Price et al. interviewed 30 
postdocs to better understand 
how their scientific identities 
influenced their career goals. 
Using discourse analysis, the 
research team categorized two 
primary scientific identities 
among the postdocs: (i) bench 
scientists, who implement the 
scientific visions of others, and 
(ii) principal investigators, who 
formulate their own scientific 
visions. A further understand- 
ing of how these identities 
develop is essential for imple- 
menting better professional 
development programs for 
postdocs as they embark on 
scientific careers. -—MMc 

CBE Life Sci. Educ. 10.1187/ 

cbe.17-08-0177 (2018). 


NEUROSCIENCE 
Overcoming working 


memory limitations 
The capacity of our work- 
ing memory is limited. We 
normally do not notice these 
limitations because we use 
a repertoire of behavioral 
strategies to overcome them. 
To investigate how these 
high-level strategies inter- 
act with working memory, 
Chiang and Wallis trained 
rhesus macaques to perform 
a search task and recorded 
from neurons in their lateral 
prefrontal cortex. Individual 
neurons encoded the location 
of the current search target, 
as well as the location of other 
targets several steps earlier or 
later in the search sequence. 
Both upcoming targets and 
previously visited targets were 
encoded well above chance. 
However, lateral prefrontal 
cortical neurons were more 
likely to encode upcoming tar- 
gets. Macaques spontaneously 
used behavioral strategies that 
helped them solve the task; 
this improved their perfor- 
mance while simultaneously 
reducing the neuronal load 
required. —PRS 

Proc. Natl. Acad. Sci. U.S.A. 

115, 5010 (2018). 
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SINGLE-CELL ANALYSIS 
Mapping the vertebrate 
developmental landscape 


As embryos develop, numerous 
cell types with distinct functions 
and morphologies arise from 
pluripotent cells. Three research 
groups have used single-cell 
RNA sequencing to analyze 
the transcriptional changes 
accompanying development of 
vertebrate embryos (see the 
Perspective by Harland). Wagner 
et al. sequenced the transcrip- 
tomes of more than 90,000 
cells throughout zebrafish 
development to reveal how cells 
differentiate during axis pattern- 
ing, germ layer formation, and 
early organogenesis. Farrell et al. 
profiled the transcriptomes of 
tens of thousands of embryonic 
cells and applied a computa- 
tional approach to construct a 
branching tree describing the 
transcriptional trajectories that 
lead to 25 distinct zebrafish 
cell types. The branching tree 
revealed how cells change their 
gene expression as they become 
more and more specialized. 
Briggs et al. examined whole 
frog embryos, spanning zygotic 
genome activation through 
early organogenesis, to map 
cell states and differentiation 
across all cell lineages over time. 
These data and approaches pave 
the way for the comprehensive 
reconstruction of transcriptional 
trajectories during development. 
—BAP 

Science, this issue p. 981, p. 979, 

p. 980; see also p. 967 


PLANETARY SCIENCE 
Methane ice dunes 
on Pluto 


Wind-blown sand or ice dunes 
are known on Earth, Mars, 
Venus, Titan, and comet 67P/ 
Churyumov-Gerasimenko. Telfer 
et al. used images taken by the 
New Horizons spacecraft to 
identify dunes in the Sputnik 
Planitia region on Pluto (see the 
Perspective by Hayes). Modeling 
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shows that these dunes could be 
formed by sand-sized grains of 
solid methane ice transported in 
typical Pluto winds. The methane 
grains could have been lofted 
into the atmosphere by the 
melting of surrounding nitrogen 
ice or blown down from nearby 
mountains. Understanding how 
dunes form under Pluto condi- 
tions will help with interpreting 
similar features found elsewhere 
in the solar system. —KTS 
Science, this issue p. 992; 
see also p. 960 


BIOMATERIALS 
I’ve got a feeling 


Sensory (or afferent) nerves 
bring sensations of touch, 
pain, or temperature variation 
to the central nervous system 
and brain. Using the tools and 
materials of organic electronics, 
Kim et al. combined a pressure 
sensor, a ring oscillator, and 
an ion gel-gated transistor to 
form an artificial mechanore- 
ceptor (see the Perspective by 
Bartolozzi). The combination 
allows for the sensing of multiple 
pressure inputs, which can be 
converted into a sensor signal 
and used to drive the motion of 
a cockroach leg in an oscillatory 
pattern. —MSL 

Science, this issue p. 998; 

see also p. 966 


HUMAN EVOLUTION 
Founder effects in modern 
populations 


The genomes of ancient humans 
can reveal patterns of early 
human migration (see the 
Perspective by Achilli et al.). 
Iceland has a genetically distinct 
population, despite relatively 
recent settlement (~1100 years 
ago). Ebenesersdéttir et al. exam- 
ined the genomes of ancient 
Icelandic people, dating to near 
the colonization of Iceland, and 
compared them with modern- 
day Icelandic populations. The 
ancient DNA revealed that the 
founders had Gaelic and Norse 
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origins. Genetic drift since the 
initial settlement has left modern 
Icelanders with allele frequen- 
cies that are distinctive, although 
still skewed toward those of 
their Norse founders. Scheib et 
al. sequenced ancient genomes 
from the Channel Islands of 
California, USA, and Ontario, 
Canada. The ancient Ontario 
population was similar to other 
ancient North Americans, as 
well as to modern Algonquian- 
speaking Native Americans. In 
contrast, the California individu- 
als were more like groups that 
now live in Mexico and South 
America. It appears that a genetic 
split and population isolation 
likely occurred during the Ice Age, 
but the peoples remixed at a later 
date. —LMZ 

Science, this issue p. 1028, p.1024; 

see also p. 964 


PESTICIDES AND SOCIETY 
Glyphosate: Menace 


or savior? 


The herbicide glyphosate is used 
in numerous products, includ- 
ing Roundup, to kill weeds that 
compete with crops and to dry 
out crops, making them easier 
to harvest. In a Perspective, van 
Straalen and Legler highlight the 
recent decision by the European 
Union to reapprove glyphosate 
for another 5 years. Although 
several assessments have con- 
cluded that glyphosate does not 
pose a danger to humans at the 
concentrations typically encoun- 
tered, these assessments do not 
account for the combinatorial 
effects of other chemicals in the 
herbicide products. Because little 
is known about how glyphosate 
affects ecosystems, increasing 
its use is worrisome. Societal 
concerns about reliance on 
pesticides need to be considered 
through socioeconomic impact 
analysis as part of the decision- 
making process. —JFU 

Science, this issue p. 958 
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CANCER METABOLISM 
Metabolic plasticity foils 
drug development 


The metabolism of cancer cells 
is increasingly being targeted 
in drug development, but the 
results in patients are not 
always consistent with pre- 
clinical models. In a Perspective, 
Muir and Vander Heiden discuss 
the importance of nutrient avail- 
ability in the microenvironment 
as a determinant of responses 
to drugs that target metabolism 
and how metabolic targets in 
cancer cells might be identified 
more effectively. —GKA 

Science, this issue p. 962 


ECOLOGY 
Reducing bycatch while 


sustaining harvests 


Reducing bycatch—the capture 
of nontarget marine spe- 
cies—remains challenging for 
global fisheries, which provide 
essential sources of protein 
for billions of people. Even in 
well-managed fisheries, the 
ensnaring of species such as 
turtles, seabirds, and sharks is 
a problem. For many species, 
the solutions used to mitigate 
bycatch have not been success- 
ful. Focusing on the California 
drift gillnet fishery, Hazen et 
al. developed a multispecies 
ocean management approach 
that combines several types 
of historical data on multiple 
species’ movement to predict 
daily catch. With this tool, they 
discovered that operating the 
California drift gillnet fishery on 
the basis of daily oceanographic 
conditions, instead of more 
static approaches based on 
rigid boundaries, could greatly 
reduce bycatch while maintain- 
ing fish harvests. —PJB 

Sci. Adv. 10.1126/sciadv.aar3001 

(2018). 
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ACTUATORS 
Strength and stability 
through opposition 


Biohybrid robots that combine 
living muscles with synthetic 
skeletons can be limited in their 
utility by the muscles sponta- 
neously shrinking over time. 
Morimoto et al. used antago- 
nistic pairs of skeletal muscle 
tissues to prevent this shrinkage 
and extend the robot's life span. 
They attached myoblast-laden 
hydrogel sheets to both sides 

of a jointed resin skeleton; the 
sheets formed into antagonistic 
muscles after 10 days of culture. 
The resulting robot achieved 
large actuation over a long 
lifetime and used finger-like 
movements to pick and place a 
ring onto a pillar. —RLK 


Sci. Robot. 3, eaat4440 (2018). 
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SINGLE-CELL ANALYSIS 


Single-cell reconstruction of 
developmental trajectories during 
zebrafish embryogenesis 


Jeffrey A. Farrell,* Yiqun Wang,* Samantha J. Riesenfeld, Karthik Shekhar, 


Aviv Regev,{ Alexander F. Schier{ 


INTRODUCTION: During embryogenesis, plu- 
ripotent cells gradually become specialized and 
acquire distinct functions and morphologies. Be- 
cause much of the specification process is con- 
trolled through changes in gene expression, the 
identification of the transcriptional trajectories 
underlying cell fate acquisition is paramount to 
understanding and manipulating development. 


RATIONALE: Traditional approaches have 
studied specific fate decisions by analyzing the 
transcription of a few selected marker genes or 
by profiling isolated, predefined cell populations. 
The advent of large-scale single-cell RNA se- 
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quencing (scRNA-seq) provides the means to 
comprehensively define the gene expression 
states of all embryonic cells as they acquire 
their fates. This technology raises the possibility 
of identifying the molecular trajectories that 
describe cell fate specification by sampling 
densely during embryogenesis and connecting 
the transcriptomes of cells that have similar gene 
expression profiles. However, the numerous 
transcriptional states and branch points, as well 
as the asynchrony in developmental processes, 
pose major challenges to the computational 
reconstruction of developmental trajectories 
from scRNA-seq data. 
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Embryonic development 


Developmental tree of early zebrafish embryogenesis. Single-cell transcriptomes were generated 
from zebrafish embryos at 12 developmental stages (six of which are shown). The transcriptional 
trajectories that describe the fate specification of 25 cell types were reconstructed from the data. 
Molecular specification is visualized with a force-directed layout, in which each cell is represented by 
a point (colored by developmental stage), proceeding from pluripotent cells (at the bottom center) 
outward to 25 distinct cell types. A subset of the identified trajectories are labeled in groups. 
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RESULTS: We generated single-cell transcrip- 
tomes from 38,731 cells during early zebrafish 
embryogenesis at high temporal resolution, 
spanning 12 stages from the onset of zygotic 
transcription through early somitogenesis. We 
took two complementary approaches to identify 
the transcriptional trajectories in the data. First, 
we developed a simulated diffusion-based compu- 
tational approach, URD, which identified the tra- 
jectories describing the specification of 25 cell 
types in the form of a branching tree. Second, 
we identified modules of coexpressed genes and 
connected them across developmental time. Com- 


bining the reconstructed 


developmental trajectories 
Read the full article with differential gene expres- 
at http://dx.doi. sion analysis uncovered gene 
org/10.1126/ . . 

expression cascades leading 


science.aar3131 


to each cell type, including 
previously unidentified mark- 
ers and candidate regulators. Combining these 
trajectories with Seurat, which infers the spatial 
positions of cells on the basis of their transcrip- 
tomes, connected the earlier spatial position of 
progenitors to the later fate of their descendants. 
Inspection of the developmental tree led to 
new insights about molecular specification in 
zebrafish. For example, the first branch point in 
the tree indicated that the first molecular spec- 
ification event may not only separate the germ 
layers but also define the axial versus nonaxial 
mesendoderm. Additionally, some developmen- 
tal branch points contained intermediate cells 
that expressed genes characteristic of multiple 
downstream cell fates. Gene expression analysis 
at one such branch point (the axial mesoderm) 
suggested that the intermediate cells switch their 
specification from one fate (notochord) to another 
(prechordal plate). Last, analysis of single-cell 
transcriptomes from a Nodal-signaling mutant 
revealed that even at the whole-transcriptome 
level, mutant cells were canalized into a subset 
of wild-type states and did not adopt any tran- 
scriptional states not observed in wild type, de- 
spite abnormal developmental signaling. 


CONCLUSION: These findings reconstruct the 
gene expression trajectories during the embryo- 
genesis of a vertebrate and highlight the con- 
current canalization and plasticity of cell type 
specification. The secRNA-seq data and develop- 
mental tree provide a rich resource for future 
studies in zebrafish: The raw and processed data 
and the URD software are available for down- 
load, and the data can be browsed interactively 
online. Last, this approach provides a broadly 
applicable framework with which to reconstruct 
complex developmental trajectories from single- 
cell transcriptomes. 
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*These authors contributed equally to this work. 
}Corresponding author. Email: aregev@broadinstitute.org 
(A.R.); schier@fas.harvard.edu (A.F.S.) 

Cite this article as J. A. Farrell et al., Science 360, eaar3131 
(2018). DOI: 10.1126/science.aar3131 


lof1 


8102 ‘p eunr uo /Bio Beweousiossoualos//:dijy Wo. pepeojumMoq 


RESEARCH 


RESEARCH ARTICLE 


SINGLE-CELL ANALYSIS 


Single-cell reconstruction of 
developmental trajectories during 
zebrafish embryogenesis 


Jeffrey A. Farrell,’* Yiqun Wang,’* Samantha J. Riesenfeld,” Karthik Shekhar,” 


Aviv Regev,”*+ Alexander F. Schier??*****7+ 


During embryogenesis, cells acquire distinct fates by transitioning through transcriptional 
states. To uncover these transcriptional trajectories during zebrafish embryogenesis, we 
sequenced 38,731 cells and developed URD, a simulated diffusion-based computational 
reconstruction method. URD identified the trajectories of 25 cell types through early 
somitogenesis, gene expression along them, and their spatial origin in the blastula. Analysis 
of Nodal signaling mutants revealed that their transcriptomes were canalized into a subset 
of wild-type transcriptional trajectories. Some wild-type developmental branch points 
contained cells that express genes characteristic of multiple fates. These cells appeared to 
trans-specify from one fate to another. These findings reconstruct the transcriptional 
trajectories of a vertebrate embryo, highlight the concurrent canalization and plasticity of 
embryonic specification, and provide a framework with which to reconstruct complex 
developmental trees from single-cell transcriptomes. 


uring embryogenesis, a single totipotent 

cell gives rise to numerous cell types with 

distinct functions, morphologies, and spa- 

tial positions. Because this process is pri- 

marily controlled through transcriptional 
regulation, the identification of the transcrip- 
tional states underlying cell fate acquisition is 
paramount to understanding and manipulating 
development. Previous studies have presented 
different views of cell fate specification. For ex- 
ample, artificially altering transcription factor 
expression (such as in reprogramming) has re- 
vealed remarkable plasticity of cellular fates (7-3). 
Conversely, classic embryological studies have in- 
dicated that cells are canalized to adopt perduring 
fates separated by epigenetic barriers. Techno- 
logical limitations necessitated that traditional 
embryological studies focus on specific fate de- 
cisions with selected marker genes, but the ad- 
vent of single-cell RNA sequencing (scRNA-seq) 
raises the possibility of fully defining the tran- 
scriptomic states of embryonic cells as they ac- 
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quire their fates (4-8). However, the large number 
of transcriptional states and branch points, as well 
as the asynchrony in developmental processes, 
pose major challenges to the comprehensive iden- 
tification of cell types and the computational 
reconstruction of their developmental trajectories. 
Pioneering computational approaches to uncover 
developmental trajectories (5-7, 9-11) were either 
designed to address stationary or steady-state 
processes or accommodate only small numbers 
of branch points and thus are insufficient for 
addressing the complex branching structure of 
time-series developmental data. We addressed 
these challenges by combining large-scale single- 
cell transcriptomics during zebrafish embryo- 
genesis with the development of a new simulated 
diffusion-based computational approach to re- 
construct developmental trajectories, called URD 
(named after the Norse mythological figure who 
nurtures the world tree and decides all fates). 


High-throughput scRNA-seq 
from zebrafish embryos 


We profiled 38,731 cells from 694 embryos across 
12 closely spaced stages of early zebrafish de- 
velopment using Drop-seq, a massively parallel 
scRNA-seq method (72). Samples spanned from 
high blastula stage (3.3 hours postfertilization, 
just after transcription from the zygotic genome 
begins), when most cells are pluripotent, to six- 
somite stage (12 hours postfertilization, shortly 
after the completion of gastrulation), when many 
cells have differentiated into specific cell types 
(Fig. 1A and table S1). In a ¢-distributed stochastic 
neighbor embedding (tSNE) plot (73) of the en- 
tire data set based on transcriptional similarity, 


it is evident that developmental time was a strong 
source of variation in the data, but the underlying 
developmental trajectories were not readily ap- 
parent (Fig. 1B). Consistent with the understand- 
ing that cell types become more transcriptionally 
divergent over time, cells from early stages formed 
large continuums in the tSNE plot, whereas more 
discrete clusters emerged at later stages (Fig. 1C). 


URD reconstructs complex branching 
developmental trajectories 


Acquisition of many single-cell embryonic tran- 
scriptomes with high temporal resolution created 
the possibility of reconstructing developmental 
trajectories through similarity in gene expression 
profiles. Such an approach would allow the in- 
vestigation of the gene expression dynamics and 
the timing of molecular specification—when pro- 
genitor populations become transcriptionally dis- 
tinct from each other and begin to express the 
regulators that will drive their future fates. There- 
fore, we developed URD, an approach to uncover 
complex developmental trajectories as a branch- 
ing tree. URD extends diffusion maps [originally 
presented for single-cell differentiation analysis 
in pioneering work by Haghverdi et al. and their 
R package, destiny (9, 10)] through several ad- 
vances: It introduces a new way to order cells in 
pseudotime, finds developmental trajectories, dis- 
covers an underlying branching tree that abstracts 
specification, and visualizes the data (Fig. ID and 
supplementary materials, materials and methods). 

In general, URD operates by means of “sim- 
ulating diffusion,” using discrete random walks 
and graph searches to approximate the continuous 
process of diffusion (supplementary materials, ma- 
terials and methods). Briefly, URD constructs a 
k-nearest-neighbor graph between transcriptomes 
in gene expression space; graph edges are assigned 
transition probabilities that are used as weights 
in later simulations and describe the chance 
a random walk would move along each edge 
(Fig. 1D, 1) (9, 10). The user identifies the root(s) 
(starting points) and tips (end points) of the de- 
velopmental process. Cells are next assigned a 
pseudotime—an ordering that should reflect their 
developmental progress rather than absolute 
time—in order to compensate for developmental 
asynchrony. URD calculates pseudotime by sim- 
ulating diffusion from the root to determine 
each cell’s distance from the root (as the average 
number of diffusive transitions needed to reach 
it across several simulations) (Fig. 1D, 2). Next, 
the developmental trajectory (the path in gene 
expression) between each tip and the root is de- 
termined by identifying which cells are visited 
by simulated biased random walks initiated in 
that tip; the walks are biased to only transition 
to cells of equal or earlier pseudotime so that 
when they reach developmental branch points, 
they proceed toward the root and do not ex- 
plore other cell types (Fig. 1D, 3). Then, URD re- 
constructs a branching tree structure by joining 
pairs of trajectories where they pass through the 
same cells (Fig. 1D, 4; black and purple edges, for 
example). Last, the data are visualized with a 
force-directed layout based on cells’ visitation 
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frequency by the random walks from each tip 
(Fig. 1D, 5) (14). The developmental trajectories 
identified with URD are akin to cell lineages but 
differ from classical definitions of cell lineage be- 
cause they are reconstructed from observed gene 
expression and do not measure mother-daughter 
relationships between cells. URD does not re- 
quire any prior knowledge of the developmen- 
tal trajectories it seeks to find (such as the 
number of branch points or definition of inter- 
mediate states). 


Reconstructed developmental tree 
recapitulates molecular specification 
during zebrafish embryogenesis 


Application of URD to the early zebrafish em- 
bryogenesis scRNA-seq data generated a tree 
whose branches reflected embryonic specifica- 
tion trajectories. To define the final cell popula- 
tions (the tips of the tree), we clustered cells 
from the final stage of our time course and de- 
termined cluster identity through the expression 
of known marker genes (fig. $1). The recovered 
tree followed the specification of 25 final cell 
populations across 16 branch points (Fig. 1E, 


Fig. 1. Generation of a developmental 
specification tree for early zebrafish 
embryogenesis using URD. (A) Single-cell 
transcriptomes were collected from zebrafish 
embryos at 12 developmental stages (colored 
dots) spanning 3.3 to 12 hours postfertilization 
(hpf). (B) tSNE plot of the entire data, colored 
by stage [as in (A)]. Developmental time is a 
strong source of variation, and the underlying 
developmental trajectories are not immediately 
apparent. (C) tSNE plot of data from two stages 
(top, 50% epiboly; bottom, six-somite). Clusters 
are more discrete at the later stage. (D) URD's 
approach for finding developmental trajectories. 
1: Transition probabilities are computed from 
the distances between transcriptomes and used 
to connect cells with similar gene expression. 

2: From a user-defined “root” (such as cells of 
the earliest time point), pseudotime is calculated 
as the average number of transitions required to 
reach each cell from the root. 3: Trajectories 
from user-defined “tips” (such as cell clusters in 
the final time point) back to the root are 
identified by simulated random walks that are 
biased toward transitioning to cells younger or 
equal in pseudotime. 4: To recover an underlying 
branching tree structure, trajectories are joined 
agglomeratively at the point where they contain 
cells that are reached from multiple tips. 

5: The data are visualized with a force-directed 
layout based on cells’ visitation frequency by 
the random walks from each tip. (E) Force-directed 
layout of early zebrafish embryogenesis, 
optimized for 2D visualization (supplementary 
materials, materials and methods, fig. S2, 

and movie S1), colored by stage [as in (A)] with 
terminal populations labeled. EVL, enveloping 
layer; P, placode; Adeno., adenohypophyseal; 
Trig., trigeminal; Epi., epibranchial; Panc.+lInt., 
pancreatic + intestinal; RBI, rostral blood island; 


fig. S2, and movie S1). The reconstructed tree 
substantially recapitulated the developmental 
trajectories expected from classical embryological 
studies (15-17). For example, the primordial germ 
cells, the enveloping layer cells (EVLs), and the 
deep-layer blastomeres already formed separate 
trajectories by high stage (Fig. 1F). Unexpectedly, 
the first branch point within the blastoderm not 
only separated the ectoderm from the mesendo- 
derm but also divided the axial mesoderm from 
the remainder of the mesendoderm (Fig. 1G). 
Later branching events were also recovered, such 
as the separation of paraxial, lateral, and inter- 
mediate mesoderm (Fig. 1H); the separation of 
the non-neural and neural ectoderm (Fig. 11); and 
the eventual branching of the non-neural ecto- 
derm into epidermis, neural plate border, and 
multiple preplacodal ectoderm trajectories (Figs. 
1E and 2, trajectory, and fig. S3). Displaying the 
expression of classic marker genes on the de- 
velopmental tree highlighted expected trajectories 
and confirmed their annotation (Fig. 2, gene 
expression, and fig. S3). For example, consistent 
with its known expression, the notochord marker 
gene noto was restricted primarily to two tra- 
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jectories (the entire notochord trajectory and 
the later stages of the tailbud trajectory) and 
confirmed that the branch point between the 
notochord and prechordal plate was correctly 
placed (18). These results show that URD can 
reconstruct the highly complex branching tra- 
jectories of early zebrafish embryogenesis solely 
on the basis of large-scale scRNA-seq data. 


Connected gene modules 
support reconstructed 
developmental trajectories 


To complement the specification tree, and in order 
to find groups of genes that are coexpressed 
within cell populations, we applied non-negative 
matrix factorization (NMF) to the single-cell tran- 
scriptomes (19). This approach produced modules 
of covarying genes and described cells in terms 
of module expression (which is more robust than 
individual gene expression measurements) (4, 20). 
Modules were annotated post hoc according 
to their highly ranked classic marker genes, 
and similar modules from adjacent stages were 
linked to each other according to the overlap of 
their most highly ranked genes. This approach 
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created chains of connected gene modules that 
provided an alternative way to track develop- 
mental trajectories (fig. S4 and table $2). For 
example, the prechordal plate chain of connected 
gene modules extended from 50% epiboly to 
six-somite stage, during which the top ranking 
genes gradually changed from early to late markers 
for the prechordal plate (table S3). 

The URD-generated developmental tree and 
the chains of connected gene modules provided 
two different ways to analyze the scRNA-seq data 
and define developmental trajectories. To deter- 
mine how congruent these approaches are, we 
highlighted cells in the developmental tree ac- 
cording to their expression of connected gene 
modules. Cells that express connected gene 
modules occupied specific URD-recovered de- 
velopmental trajectories, further supporting 
the structure of the developmental tree re- 
constructed by URD (Fig. 2, module expression, 
and fig. $3). 


Gene cascades reveal 
expression dynamics along 
developmental trajectories 


Gene expression and gene module analysis were 
combined in order to find candidate regulators 
and markers along each trajectory uncovered with 
URD. Genes and connected gene modules were 
associated with developmental trajectories by 
testing for their differential expression down- 
stream of each branch point of URD’s recovered 
branching structure (Fig. 3, A and B; and sup- 
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plementary materials, materials and methods). 
Gene expression dynamics were then fit with an 
impulse model (27) so as to determine the onset 
and offset time of their expression, which was then 
used to order genes. As an example, sequential 
expression was observed for 197 genes enriched 
in the prechordal plate during its specification, 
including several well-known transcription fac- 
tors or signaling molecules that confirm the va- 
lidity of our approach (such as gsc, foxa3, kIf17, 
and frzb) (Fig. 3C and figs. S5 to S7) (22-25). 
This cascade contained several regulatory factors 
(such as fgd8a, fzd8b, miltIb, and inhbaa) with- 
out described roles in the prechordal plate that 
would be candidates for reverse genetic screens. 
This cascade also contained more than 40 genes 
that were not previously annotated as associated 
with the prechordal plate (fig. S6), and those 
tested by means of in situ hybridization were in- 
deed expressed in the prechordal plate (fig. $7). 
Thus, combining URD and gene module analysis 
uncovered the transcriptional cascades that ac- 
companied the development of progenitors into 
differentiated cells and highlighted both previously 
characterized and newly identified trajectory- 
enriched genes. 


Combining developmental trajectories 
with spatial analysis infers 
progenitor locations 


The URD-generated tree is a powerful way to 
visualize developmental trajectories but lacks 
spatial information. We therefore asked whether 
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Fig. 2. Developmental trajectories, genes, and connected gene modules overlaid on the 
force-directed layout. From top to bottom, (i) the trajectories identified by URD from the root to a 
given population (or group of populations), (ii) gene expression of a classical marker of that population, and 
(iii) expression of a six-somite gene module active in the population and its connected modules from 


earlier stages. The remainder are presented in fig. S3. 
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the trajectories could be traced to their spatial 
origin at the late blastula stage (J6, 17). First, a 
spatial map of the Drop-seq 50% epiboly tran- 
scriptomes was generated by using Seurat, a 
method we previously developed to infer the spa- 
tial locations of single-cell transcriptomes by com- 
paring the genes expressed in each transcriptome 
with the spatial expression patterns of a few 
landmark genes obtained from RNA in situ hy- 
bridization (20). Second, Seurat’s spatial map 
was combined with either URD or connected 
gene module analysis (as parallel, independent 
approaches) in order to associate cell popula- 
tions at six-somite stage with the location of 
their “pseudoprogenitors” at 50% epiboly. In 
one approach, we used URD’s simulated random 
walks from cell populations at six-somite (Fig. 3A, 
pink bars) to infer their pseudoprogenitor cells 
and then plotted the spatial location of the 50% 
epiboly pseudoprogenitors using Seurat (Fig. 
4A). In the other approach, we plotted the spa- 
tial expression of each 50% epiboly gene module 
and identified its connected gene modules at 
later stages (Fig. 3B, blue bars on right); the cells 
that express these connected gene modules are 
highlighted on the developmental tree (Fig. 4B) 
and are the “pseudodescendents” that arise from 
the spatial domain identified by the 50% epiboly 
gene module. 

The results from the two approaches were 
highly concordant and agreed with classic fate- 
mapping experiments (16, 17). For example, both 
approaches associated the dorsal margin of the 
50% epiboly embryo with axial mesodermal fates 
(prechordal plate and notochord). Likewise, in 
both cases the animal pole was associated with 
ectodermal fates, with the ventral animal side 
biased to non-neural fate and the dorsal animal 
side to neural fate. Overall, these results show 
that scRNA-seq data can be used not only to 
reconstruct specification trajectories and their 
associated gene cascades but also to connect 
the earlier spatial position of progenitors to the 
later fate of their descendants. 


Nodal signaling mutant cells are 
canalized into a subset of wild-type 
transcriptional states 


Previous studies of mutant embryos have pro- 
vided important insights into embryonic fate 
specification, but scRNA-seq raises the possibility 
of rapidly phenotyping mutants both genome- 
wide and at single-cell resolution. To test this 
idea, we profiled maternal-zygotic one-eyed pinhead 
mutants (MZoep), which lack the coreceptor for 
the mesendoderm inducer Nodal (26, 27). We 
first asked whether previous MZoep embryological 
results could be reconstructed simply from scRNA- 
seq data. Transcriptomes were generated with 
Smart-seq2 (28), with deeper mRNA coverage 
than that of Drop-seq (fig. S8); 325 MZoep and 
1047 wild-type transcriptomes were collected at 
50% epiboly, when Nodal signaling is normally 
active at the blastoderm margin. Using Seurat, we 
inferred the spatial origin of both wild-type and 
MZoep transcriptomes based on a wild-type land- 
mark map (Fig. 5A). As predicted, no MZoep cells 
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mapped to the margin of a wild-type embryo, 
where mesendodermal progenitors arise. Apply- 
ing NMF revealed that in MZoep mutants, 
expression of the marginal dorsal, dorsal, and 
marginal gene modules is greatly reduced or ab- 
sent, but no mutant-specific modules were found 
(Fig. 5B). NMF recovered the same 50% epiboly 
gene modules from Smart-seq and Drop-seq data 
(Fig. 5B and table S4), which allowed us to use 
the chains of connected Drop-seq gene modules 
to predict the MZoep mutant phenotype at later 
stages. Namely, our approach successfully pre- 
dicted the loss of paraxial mesoderm, ventrolateral 
mesoderm, axial mesoderm, and endoderm (Fig. 
5C), as well as the continued presence of the 
tailbud, as found in previous embryological 
studies (26). Analysis of additional single-cell 
transcriptomes from wild-type and MZoep mu- 
tants at six-somite stage largely verified our pre- 
dictions (fig. S9 and table S5). Together, these 
results show that combining modest-scale scRNA- 
seq in mutants with the large-scale developmental 
reference tree constructed for wild type can rap- 
idly provide phenotypic insights: In the absence 
of Nodal signaling, marginal blastomeres that 
would normally become mesendodermal pro- 
genitors instead become ectodermal and tail 
progenitors, resulting in the absence of mesen- 
dodermal cell types and an altered fate map, as 
shown in previous cell-tracing experiments (27). 
We next asked whether the mutant scRNA-seq 
dataset could provide novel insights into cell 
identity in the absence of Nodal signaling. Mor- 
phological analysis and fate mapping of MZoep 
mutants has found that all cells appear to adopt 
wild-type fates (26, 27). However, intricate inter- 
actions exist between the several signaling path- 
ways active during early development, and the 
elimination of Nodal signaling changes levels 
and domains of other developmental signals in 
the embryo. Thus, MZoep mutant cells could po- 
tentially perceive signaling input combinations 
that do not occur in wild-type embryos, which 
may result in gene expression states not observed 
in wild type. We therefore wondered whether 
mutant cells expressed previously unknown com- 
binations of gene modules under this altered 
signaling landscape or were transcriptionally 
equivalent to a subset of wild-type states. Co- 
clustering of wild-type and MZoep transcriptomes 
according to their gene module expression re- 
vealed that while some mesendodermal cell types 
were absent in MZoep at 50% epiboly, the remain- 
ing cells clustered with wild-type states (Fig. 5D 
and fig. S10). This result indicates that even on the 
whole-transcriptomic and single-cell level, mutant 
cells are canalized into a subset of wild-type fates 
after the loss of an essential signaling pathway. 


Hybrid gene expression states reveal 
developmental plasticity 


Inspection of the developmental tree revealed 
that most cells fell along tight trajectories, but 
some cells were located in intermediate zones 
between branches. This observation seemed at 
odds with the view that embryonic cells traverse 
a developmental trajectory until a branch point 
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Fig. 3. Association of developmental trajectories with temporal gene expression patterns. (A) The 


underlying branching structure found by URD. Pink bars demarcate collections of cell types used in 
Fig. 4A. (B) The structure of connected gene modules. Each circular node represents a module and is 
colored by the developmental stage from which the module was computed (as in Fig. 1A). Blue bars 
demarcate collection of modules downstream of each 50% epiboly (5.3 hpf) gene module used in 

Fig. 4B. (C) Gene expression cascades during specification of the prechordal plate and notochord. 
Expression is displayed as a moving-window average in pseudotime (along the x axis), scaled to the 
maximum observed expression. Selected genes are labeled along the y axis. Genes are annotated with 
whether they were identified as a differentially expressed gene, as a top ranking member of a 
differentially expressed connected gene module, or both. Cascades for all trajectories (with all genes 


labeled) are presented in fig. S5. 


funnels them cleanly and irreversibly into one of 
multiple downstream branches. For example, at 
the axial mesoderm branch point, most cells fell 
along the classic bifurcation from progenitor into 
notochord or prechordal plate fates (15, 29), with 
waves of gene expression corresponding to their 
specification and differentiation status (Fig. 6A, 
highlighted). However, ~5.4% were intermediate 
cells that expressed both notochord and pre- 
chordal plate markers (Fig. 6, A and B). The 
intermediate cells and the completely bifurcated 
axial mesoderm cells with similar pseudotimes 
came from embryos at the same developmental 


stage (Fig. 6C); moreover, they no longer ex- 
pressed genes characteristic of the progenitors 
(such as nanog and mex3b). These observations 
eliminated models in which intermediate cells 
retained their progenitor state and delayed spec- 
ification. Instead, we noticed that these cells 
expressed early markers of both programs (gsc, 
Srzb, ta, and noto) but later markers of only the 
notochord program (ntd5 and shha). This observa- 
tion raised two possible models: (i) Cells initially 
express both programs, then shut off the pre- 
chordal plate program, and produce only noto- 
chord markers later (“dual-specification” model); 
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Fig. 4. Molecular specification maps relate cell 


position at 50% epiboly to cell fate at 


six-somite. (A) Visitation by random walks from given tip(s) (as proportion of visitation from all 


tips) and the spatial location of visited 50% epibo 
are marked in Fig. 3A. (B) Spatial expression of 50 


ly cells (ventral side to the left). The six tip groups 
% epiboly gene modules. Expression of connected 


gene modules is plotted on the force-directed layout to highlight populations that will emerge 


from the 50% epiboly module's expression domai 
marked in Fig. 3B. 


or (ii) cells specified as notochord first express 
early and late notochord markers and then change 
their specification by shutting off notochord ex- 
pression and initiating early prechordal plate 
marker expression (“trans-specification” model). 

To distinguish between these two models, we 
investigated their properties and location using 
fluorescent RNA in situ hybridization. At 75% 
epiboly, some cells located in the border region 
between the two tissues coexpressed an early 
prechordal plate marker (gsc) with either an 
early (ta/ntl) or late (ntd5) notochord marker 
(Fig. 6, D to F, and fig. S11). The existence of these 
intermediate cells in situ confirms that they are 
not an artifact of sCRNA-seq (such as cell doublets), 
and their defined spatial localization near the 
border of the two tissues suggests that they are 
not merely biological or technical noise. Instead, 
the location of coexpressing cells raises the pos- 
sibility that the boundary between the notochord 
and prechordal plate territories is refined during 
gastrulation, after the two populations become 
transcriptionally distinct. Many of the coexpress- 
ing cells exhibited bright nuclear foci that denoted 
sites of active transcription of the probed genes. 
Most of the cells with nuclear transcription foci 
for a single gene exhibited transcription of the 
early prechordal plate marker, gsc. By contrast, 
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n. The six groups of connected gene modules are 


cells with active transcription of the notochord 
markers ta/ntl or ntd5 were rare (Fig. 6, E and 
F). The active transcription of the prechordal 
plate program supports the trans-specification 
model—the intriguing possibility that some axial 
mesoderm cells move down the notochord spec- 
ification path but then trans-specify into pre- 
chordal plate cells. 


Discussion 


We describe a molecular specification tree of an 
early vertebrate embryo by exploring large-scale 
single-cell transcriptomic data with URD, a com- 
putational approach to reveal developmental 
trajectories in transcriptional space. Our study 
lays the foundation for multiple areas of future 
exploration. 

First, URD is a powerful tool to reveal the 
transcriptional trajectories of complex devel- 
opmental processes such as zebrafish embryo- 
genesis. Combining URD with connected gene 
module analysis uncovered the transcriptional 
cascades underlying fate specification. More- 
over, combining URD with Seurat enabled the 
anchoring of developmental trajectories to their 
spatial origins. We anticipate that similar aug- 
mentation of URD with information about lineage 
relationships (30, 31), chromatin dynamics (32), 


8 


and signaling will further deepen insights into 
developmental processes. Additionally, some of 
URD’s limitations could be addressed by future 
enhancements. For instance, branching events 
driven by single genes (such as sov32 in the en- 
doderm) (33, 34) are not captured with URD until 
additional transcriptional differences arise, which 
could potentially be improved with more aggres- 
sive, iterative branch calling. Also, improvements 
to the throughput and quality of scRNA-seq may 
drive improved performance from URD: More 
closely spaced time points could enable detection 
of rapidly changing fate decisions and could re- 
veal sequential bifurcations at branch points cur- 
rently thought to enter multiple trajectories; 
larger numbers of cells could enable reconstruc- 
tion of rare, transcriptionally indistinct popula- 
tions (such as the floor plate and hypochord); 
and improved sequencing depth could enable 
detection of important but lowly expressed regu- 
lators that are not found in the current data (such 
as npas4l/cloche). In the future, URD could be 
used to analyze additional systems, such as in 
vitro differentiation, tissue regeneration, cancer, 
and disease progression. 

Second, the scRNA-seq data and developmen- 
tal tree provide a rich resource for future studies 
of zebrafish embryogenesis. The presented data 
describe embryonic gene expression with unparal- 
leled temporal and cellular resolution, and the 
reconstructed tree thus provides an atlas of the 
expression pattern and dynamics for nearly all 
genes. This allows inspection of gene coexpres- 
sion with much greater ease than with multicolor 
in situ hybridization, reveals markers for cell types 
of interest, and associates uncharacterized genes 
with particular cell types. Moreover, the data re- 
veal the progressive nature of cell fate specifica- 
tion and suggest potentially redundant regulators 
of developmental decisions with overlapping spa- 
tial and temporal expression, which would be 
missed in forward genetic screens. 

Third, this work begins to illustrate the tra- 
jectories, canalization, and potential plasticity of 
fate specification. With respect to trajectories, our 
analysis suggests that not all cell fate decisions 
are binary; multiple trajectories can arise simul- 
taneously from a pool of equipotent progenitors. 
For example, at the molecular level, the earliest 
specification of blastomeres separates the axial 
mesoderm, nonaxial mesendoderm, and ectoderm, 
rather than just separating the germ layers. This 
conclusion is compatible with the observation 
that the axial mesoderm progenitors reside at 
the dorsal blastula margin, where maternal fac- 
tors that specify the Mangold-Spemann organizer 
simultaneously affect the fate of those progenitors 
(15). Concerning canalization, the analysis of a 
Nodal signaling mutant reveals that even on the 
whole-transcriptomic level, mutant cells adopt a 
subset of wild-type transcriptomic states, but not 
new ones, demonstrating the extensive canaliza- 
tion during embryogenesis even after abnormal 
developmental signaling. With respect to plasticity, 
the identification of intermediate cells at the axial 
mesoderm branch point demonstrates hybrid de- 
velopmental states. In situ hybridization experiments 
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suggest that these cells trans-specify from one 
cell type (notochord) to another (prechordal plate), 
and their border zone spatial localization suggests 
refinement of the boundary between these two 
regions. These results support an alternative view 


Fig. 5. Characterization of Nodal signaling 
mutant with scRNA-seq and the developmental 
specification tree. (A) Spatial assignment of 
wild-type and MZoep transcriptomes according to 

a wild-type landmark map indicates an absence 

of wild-type marginal fates in MZoep (ventral, left). 
Shown at random are 311 wild-type transcriptomes 
to match MZoep cell number). (B) (Top) wild-type 
expression domain of spatially restricted gene mod- 
ules identified in Smart-seq data (ventral to left). 
Bottom) Violin plot of the maximum-scaled gene 
module levels in wild-type and MZoep mutant cells. 
The marginal dorsal, dorsal, and marginal gene 
modules are absent or strongly reduced in MZoep. 
C) Expression of gene modules connected to those 
missing in MZoep (marginal dorsal, dorsal, and 
marginal; red) and connected to those remaining in 
MZoep (blue). (D) Hierarchical clustering of wild-type 
and MZoep mutant transcriptomes, based on the 
scaled expression of gene modules. Number of 
clusters is determined by the Davies-Bouldin index. 
Genotype is indicated beneath the heatmap (wild 
type, green; MZoep, red). Clusters 3 and 8 contain 
only wild-type cells. All other clusters contain a 
mixture of wild-type and MZoep cells. This clustering 
analysis was sufficiently sensitive to detect computa- 
tionally simulated altered states (fig. S10). 


Fig. 6. Hybrid state of cells in the axial meso- 
derm. (A) Branch point plot, showing pseudotime 

y axis) and random walk visitation preference 

from the notochord (N, left) and prechordal plate 

P, right) tips (x axis), defined as the difference 

in visitation from the two tips divided by the sum of 
visitation from the two tips. Direct trajectories to 
notochord (green) and prechordal plate (pink) are 
highlighted, and intermediate cells are circled. 

B) Gene expression of notochord markers (top row) 
and prechordal plate markers (bottom row) at the axial 
mesoderm branch point. Intermediate cells express 
early (ta/ntl and noto) and late (ntd5 and shha) 
notochord markers but only early prechordal plate 
markers (gsc and frzb). (C) Cells at the branch 

point, colored by developmental stage. Intermediate 
cells have the same developmental stage as that 

of fully bifurcated cells with similar pseudotimes. 

D) Illustration of the prechordal plate (P) and 
notochord (N) in the 75% epiboly embryo. 

E and F) Double fluorescent in situ expression of the 
early prechordal plate marker gsc (red) and either 
the early notochord marker ta/nt/ [(E), green] 

or the late notochord marker ntd5 [(F), green] at 
75% epiboly (8 hpf). Most cells contain only 
prechordal plate marker MRNA (for example, “1") or 
notochord marker MRNA (for example, “5”). Cells 
with both prechordal plate and notochord marker 
mRNA are observed at the boundary of the two 
tissues, with red nuclear transcription foci, indicating 


of developmental fate choice to the common 
interpretation of Waddington’s model: Down- 
stream of some branch points, some cells still 
transition across the “ridges” between different 
lineages even after they are canalized into well- 


defined transcriptional states. This suggests a 
developmental plasticity that has been observed 
after perturbation (7-3, 35) but has not been well 
established in normal developmental processes. 
We propose that in a continuous morphogen 
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active transcription of gsc (for example, “2” to “4") (fig. S11). Cells that contained both ntd5 and gsc mRNA were identified and scored for their nuclear transcription 
foci, which indicate active transcription (Supplementary materials, materials and methods); 56% had one or more observable nuclear transcription dots, of which 
80% showed only active gsc transcription, 7% showed both active gsc and active ntd5 transcription, and 13% showed active ntd5 transcription alone. 
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gradient that resolves into discrete states, some 
cells will receive an amount of signal that is on the 
cusp between the two states; these cells would 
end up at the boundary between the two tissues. 
It is conceivable that continued signaling could 
provide a “push” over a shallow ridge, such as 
continued Nodal signaling in the axial meso- 
derm (22) or continued retinoic acid signaling 
in the hindbrain (36, 37). Future studies com- 
bining lineage tracing and in vivo imaging of 
gene expression will be needed to directly ob- 
serve such transitions. 


Collectively, our approach provides a frame- 


work with which to reconstruct the specification 
trajectories of many developmental systems with- 
out the need for prior knowledge of gene ex- 
pression patterns, fate maps, or lineage trajectories. 
The generation of developmental trees for dif- 
ferent species will enable comparative studies, 
and developmental statistics generated from such 
comparisons will help reveal conserved pathways 
and species-specific idiosyncracies. 
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SINGLE-CELL ANALYSIS 


The dynamics of gene expression 
in vertebrate embryogenesis 
at single-cell resolution 


James A. Briggs, Caleb Weinreb, Daniel E. Wagner, Sean Megason, Leonid Peshkin, 


Marc W. Kirschner,* Allon M. Klein* 


INTRODUCTION: Metazoan development 
represents a big jump in complexity com- 
pared with unicellular life in two aspects: 
cell-type differentiation and cell spatial or- 
ganization. In vertebrate embryos, many dis- 
tinct cell types appear within just a single day 
of life after fertilization. Studying the de- 
velopmental dynamics of all embryonic cell 
types is complicated by factors such as the 
speed of early development, complex cellular 
spatial organization, and scarcity of raw ma- 
terial for conventional analysis. Genetics and 
experimental embryology have clarified major 
transcription factors and secreted signaling 
molecules involved in the specification of early 
lineages. However, development involves par- 
allel alterations in many cellular circuits, not 
just a few well-described factors. 


RATIONALE: We recently developed a 
microfluidics-based single-cell RNA sequenc- 
ing method capable of efficiently profiling 
tens of thousands of individual transcrip- 
tomes. Building on earlier studies that showed 
how single-cell transcriptomics can reveal cell 
states within complex tissues, we reasoned that 
a series of such measurements from embryos, 
if collected with sufficient time resolution, 
could allow reconstruction of developmen- 
tal cell-state hierarchies. We focused on the 
western claw-toed frog, Xenopus tropicalis, 
which serves as one of the best-studied mod- 
el systems of early vertebrate development. 
We profiled these embryos from just before 
the onset of zygotic transcription up to a point 
at which dozens of distinct cell types have 
formed encompassing progenitors of most 
major organs. To establish aspects of devel- 
opment general to vertebrates, we addition- 
ally incorporated data from the copublished 
paper by Wagner et al. on zebrafish embryos, 
which separated from frogs about 400 mil- 
lion years ago. 


RESULTS: We profiled 136,966 single-cell 
transcriptomes over the first day of life of 
Xenopus tropicalis. Our analysis classifies 
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259 gene expression clusters across 10 time 
points, which belong to 69 annotated em- 
bryonic cell types and capture further sub- 
structure. Using a computational approach to 
link cell states between time points, a result- 
ing cell-state graph agrees well with previous 
lineage-tracing studies and shows that devel- 
opmental fate choices can be well approxi- 
mated by a treelike model. Many cell states 

are detected considera- 
bly earlier than previously 
Read the full article understood, thus revealing 
at http://dx.doi. the earliest events in their 
org/10.1126/ differentiation. The data 
science.aarS780 lends clarity to numerous 
specific developmental 
processes, such as the developmental origin 
of the vertebrate neural crest. Through an 
evolutionary comparison with zebrafish, we 
identified diverging features of develop- 
mental dynamics, including many genes show- 
ing cell-type specificity in one organism but 
not in another. Yet, we also identified con- 
served patterns in the reuse of transcription 
factors across lineages and in multilineage 
priming at fate branch points. The result- 
ing resource is available in an interactive 
online browser that allows in silico explora- 
tion of any gene in any cell state (tinyurl. 
com/scXen2018). 


CONCLUSION: The approaches and results 
presented here, along with the copublished 
paper by Wagner et al., establish the first steps 
toward a data-driven dissection of develop- 
mental dynamics at the scale of entire orga- 
nisms. They provide a useful, annotated resource 
for developmental biologists, comprehensive- 
ly tracking differentiation programs as they 
unfold on a high-dimensional gene expression 
landscape. Although demonstrated on model 
organisms, the same approaches could be trans- 
formative to the study of nonmodel organisms 
by allowing rapid and quantitative descrip- 
tion of differentiation processes across the tree 
of life, opening up a new front in evolutionary 
biology. 


Whole embryos at ten timepoints 


DE 
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Cell state linkage over time 
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v. Developmental origin of neural crest 


Single-cell analysis of whole developing 
vertebrate embryos. Xenopus embryos at 

10 time points over the first day of life were 
dissociated, barcoded, and sequenced, yielding 
136,966 single-cell transcriptomes. These data 
were clustered and connected over time to 
reveal a complete view of transcriptional 
changes in each embryonic lineage and clarify 
numerous features of early development. hpf, 
hours postfertilization. 
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SINGLE-CELL ANALYSIS 


The dynamics of gene expression 
in vertebrate embryogenesis 
at single-cell resolution 


James A. Briggs, Caleb Weinreb, Daniel E. Wagner, Sean Megason, Leonid Peshkin, 


Marc W. Kirschner,* Allon M. Klein* 


Time series of single-cell transcriptome measurements can reveal dynamic features of cell 
differentiation pathways. From measurements of whole frog embryos spanning zygotic 
genome activation through early organogenesis, we derived a detailed catalog of cell states 

in vertebrate development and a map of differentiation across all lineages over time. The 
inferred map recapitulates most if not all developmental relationships and associates new 
regulators and marker genes with each cell state. We find that many embryonic cell states 
appear earlier than previously appreciated. We also assess conflicting models of neural crest 
development. Incorporating a matched time series of zebrafish development from a companion 
paper, we reveal conserved and divergent features of vertebrate early developmental gene 


expression programs. 


etazoan development represents a big 

jump in complexity compared with uni- 

cellular life in two aspects: cell type dif- 

ferentiation and cell spatial organization. 

The fertilized egg was once itself thought 
to be very complex, concealing the spatial and 
compositional differentiation of the adult. How- 
ever, it is now clear that the egg is a rather 
simple cell, and the complexity of the embryo 
and adult arises by bootstrapping: repeatedly 
exploiting spatial asymmetries to build the com- 
plex embryo by stages. In vertebrate embryos, 
as well as in other phyla, a provisional body plan 
organization arises before overt cell-type-specific 
differentiation, and this provides a scaffold for 
the emerging complexity. 

The process of increasing spatial and cell-type 
heterogeneity has been difficult to follow com- 
prehensively at the molecular level. Many in- 
sights have emerged from genetic studies and 
experimental embryology, but those analyses 
generally focus on a few transcription factors 
and/or secreted signaling molecules. Instead, 
development involves alterations in many dif- 
ferent intracellular and intercellular circuits. 
Recent technology advances have enabled the 
mapping of mRNA content in single cells over 
time, which could in principle systematically 
reveal differentiation pathways across entire 
developing embryos (Fig. 1A). 

Two years ago, we developed a versatile single- 
cell transcriptomic method based on droplet 
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microfluidics that accurately measures genome- 
wide RNA levels in individual cells at high 
throughput (7). We applied this method to analyze 
all of the cells in the embryos of the Western claw- 
toed frog, Xenopus tropicalis, over the first day 
of life after fertilization. Along with its larger 
relative X. laevis, the frog embryo serves as one 
of the best-studied model systems of early ver- 
tebrate development. The egg undergoes 12 rapid 
cleavage divisions, during which time zygotic 
transcription is suppressed. The resulting 4000 
cells are initially pluripotent. However, within 
4 hours of activating zygotic transcription at the 
midblastula transition (2, 3), embryos go through 
gastrulation to establish the major germ layers, 
followed by fate commitment and progressive dif- 
ferentiation (4). We profiled embryos before the 
onset of zygotic transcription, through fate com- 
mitment, and into the early tailbud stage—a 
point at which dozens of distinct cell types have 
formed encompassing the early progenitors of 
most major organs. In the course of this work, 
we developed general computational tools that 
can relate cell states over time, providing a global 
view of gene expression diversification in the early 
embryo. Finally, we compared the results of this 
analysis to our recently completed time series of 
the developing zebrafish, Danio rerio. Through 
systematic analyses across lineages, time, and 
species, we note generalities and differences in 
embryonic cell differentiation in the two verte- 
brate clades, which separated about 400 million 
year's ago. 


Single-cell RNA sequencing of whole 
developing embryos 


Xenopus blastomeres are large and fragile, with 
cell diameters up to 50 um (for X. tropicalis, 


compared with 100 1m for X. laevis) at the onset 
of zygotic transcription (stage 8.5). To preserve 
the integrity of cells through tissue dissocia- 
tion and microfluidic handling, we optimized 
the inDrop platform to accommodate large cells. 
We also formulated a dissociation buffer that 
uses alkaline pH to promote dissociation and 
handling procedures that minimize shear stresses 
placed on cells. The new protocol enables rapid 
and complete dissociation of whole embryos to 
single cells within 25 min, with minimal mech- 
anical agitation and cell handling, preservation 
of >95% cell viability, and background single-cell 
RNA sequencing (scRNA-seq) signal (indicative 
of cell lysis) of <1 to 4% of total cellular RNA 
from stage 10 onward (fig. S1). 

The samples for scRNA-seq were taken at 
10 time points from zygotic genome activation 
[stage 8, 5 hours postfertilization (hpf)] through 
early organogenesis (stage 22, 22 hpf), profiling a 
total of 136,966 single cells in two replicate ex- 
periments (table S1 and movies S1 to S3). The 
first replicate consisted of 42 k cells sequenced 
to a depth of 5.4 k unique molecular identifiers 
(UMIs) per cell on average (99% of genes from 
bulk RNA-seq observed in pools of >100 cells), 
whereas the second replicate of 95 k cells was 
sequenced to an average depth of 1.4 k UMIs 
per cell. The number of measured cells allows 
detection of rare subpopulations such as germ 
cells. Overall, we expect to observe transcriptional 
states as rare as 0.1% of the embryo represented 
by at least 10 cells with 95% confidence (fig. $2). 
There may be rare or transient subpopulations 
that are missed given the total number of cells, 
time sampling, and the depth of sequencing 
we used. 

Low-dimensional visualization of the scRNA- 
seq time series shows a pattern of increasing 
complexity over time, with cells fragmenting 
from a continuum of gene expression states in 
the early gastrula into distinct clusters at later 
time points (Fig. 1B and figs. S3 and S4). To relate 
the scRNA-seq data to known embryonic cell 
types, we first classified gene expression states 
at each time point by a hierarchical clustering 
approach (Figs 1B and fig. $3). These cluster 
assignments were robust to different clustering 
algorithms (figs. S5 and S6). We then annotated 
each cluster by matching cluster-specific genes to 
>2000 in situ measurements of marker genes for 
embryonic cell types documented on the Xenopus 
Bioinformatics database (Xenbase) (5) (see Data 
Sl. We additionally shared our annotations with 
experts in the Xenopus research community, con- 
vened in part for that purpose at a recent Jamboree 
meeting, who helped to standardize our cell- 
state nomenclature and choice of annotations. 
In total, we identified 87 distinct cell-type an- 
notations, many of which persisted in clusters 
across multiple time points (average of 2.9 time 
points each) (Fig. 2C). Of these 87 states, 69 cor- 
responded unambiguously to a specific Xenbase 
anatomical term, whereas the remaining 18 states 
correspond to finer structure (i.e., cell subtypes) 
or unidentified cell states within these anatom- 
ically defined tissues. Further substructure 
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may be resolved by manual inspection of marker 
genes employing previous expert knowledge. 


Reconstructing developmental 
cell-state transitions 


Fate mapping studies have previously estab- 
lished a set of spatial lineage relationships in the 
Xenopus embryo; these generally support the 
notion of a hierarchical pattern of differentiation, 
albeit with some exceptions (6). We investigated 
the extent to which these known hierarchical 
lineage relationships are reflected in our map 
of expression states. A close match was not as- 
sured because there are several reasons that 
cell lineage hierarchies might not be reflected 
in molecular expression. First, gene expression 
states are defined not only by tissue but by 
spatial position and cell cycle state, which cross 
lineage boundaries. Second, asymmetric cell di- 
visions could lead to discontinuities in cell state 
as a mother cell partitions into daughter cells 
with distinct molecular compositions. Third, al- 
though the lineage history of individual cells 
consists entirely of discrete bifurcation (mitosis) 
events, cell states could be continuous or show 
complex branching structures or even loops. Yet, 
counteracting this, in the lineage data of the 
early embryo, new gene expression generally 
arises in single contiguous domains. 

To ask whether the single-cell gene expression 
states conform to a branching pattern and, if so, 
whether the pattern overlaps with the previously 
deduced lineage patterns, we devised a simple 
algorithm that connects each cell to its most 
likely ancestors (i.e., nearest neighbors) in the 
previous time point and uses the consensus of 
these connections to assign an ancestor to each 
cell state (Fig. 2A). By applying this algorithm 
to all 259 cell states, composed of 136,966 Xenopus 
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Fig. 1. Dissection of early Xenopus tropicalis development by 
scRNA-seq. (A) Single-cell transcriptomes represent points in 

a high-dimensional gene expression space. By collecting single-cell 
transcriptomes over time of embryo development, it is possible to infer a 
continuum gene expression manifold connecting cell states across all 
lineages. (B) Summary of scRNA-seq developmental time course, 
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cells spread over 10 time points (Fig. 2B), we 
generated a detailed map of putative cell-state 
transitions during early development (Fig. 2C 
and fig. $7). The map recapitulates known ex- 
pression domains of master regulators within 
each germ layer over time (fig. S8). Notably, only 
17% of all votes fell outside of consensus ancestor 
clusters, showing that cell states over develop- 
ment can generally be approximated by a treelike 
structure (Fig. 2D and fig. $7). The majority of 
votes cast outside of consensus ancestor states 
occurred at early time points, due to their con- 
tinuum structure. The branching nature of cell 
states can also be appreciated by inspecting 
individual lineages at single-cell resolution (Fig. 2, 
E and F). The inferred cell-state tree structure 
was consistent with ancestor assignments gen- 
erated using an alternative-state graph coarse- 
graining algorithm reported in our companion 
paper (7) (fig. S9). 

The ancestor assignments largely agree with 
the known lineage relationships. Of all cross- 
time point edges inferred by the algorithm (Fig. 
2C), we could confirm 234 (91%) by comparison 
to Xenbase anatomy ontology (XAO) (8, 9) and 
the literature (see Data S2). Of the remainder, 22 
could not be ruled in or out, and only 2 (1%) were 
identifiably incorrect. The two errors, which oc- 
curred in somites and erythroid lineages, re- 
fected a specific limitation of the tree-building 
approach: In both lineages, mature states ini- 
tially arise from progenitors but then form a 
parallel branch rather than continuously arising 
from progenitors anew at each time point. Vis- 
ualization of the raw scRNA-seq data reveals the 
underlying asynchronous differentiation process 
(fig. S10). In addition to these errors, the cell- 
state tree does not consistently resolve spatial 
localizations—e.g., we do not resolve individual 


somites. The tree-building approach could addi- 
tionally generate errors when applied to other 
data sets lacking the dense time-sampling car- 
ried out here. When inferring cell-state transi- 
tions from scRNA-seq snapshots separated by 
large gaps in time, intermediate progenitor states 
may be overlooked. Overall, these results support 
a growing body of single-cell bioinformatics 
methods that seek to infer developmental cell 
trajectories on the basis of continuity (10, 17) and 
distance minimization (12) in gene expression 
space, but also illuminate where this principle 
can be misleading. 

Among the developmental relationships in- 
ferred on the tree, there is also a question about 
the timing at which known cell types first ap- 
pear. Of the 69 cell types with an unambiguous 
match to Xenbase, 60 appeared in our data at 
the developmental stage indicated in XAO 
(8, 9) or earlier. Several appeared much earlier 
than previously recognized (Fig. 3A), includ- 
ing an endothelial/hemangioblast progenitor, 
which appeared from the dorsal lateral plate 
region at stage 18, as compared with stages 26 
and 31, respectively, as previously thought (13, 14) 
(Fig. 3B). This afforded an opportunity to ex- 
plore the earliest transcriptional events as- 
sociated with the specification of these fates 
(Fig. 3C). The tail bud and multiple epidermal 
cell types also appeared much earlier than indi- 
cated by XAO (Fig. 3A and Data S2) and revealed 
previously unknown early transcriptional dynamics. 

Overall, the cell-state tree provides a resource 
for probing gene expression in the early Xenopus 
embryo, incorporating annotations of cell states, 
linking related states across time, and discover- 
ing the earliest transcriptional record of the 
specification of each cell type. It is now possible 
to identify genes that are differentially expressed 


n=10,795 


KY 


$22 = ~ 


@ Neural 


n= 16,642 ? 4 


tSNE dimension 1 


1 June 2018 


Pluripotent blastula 
@ Germiine 
@ Non-neural ectoderm 
@ Placodal 
@ Specialized epidermis 
© Neural crest 
“> Marginal zone 
@ Dorsal mesoderm 
@ Vent/at/int. mesoderm 
@ Endoderm 


including 136,966 single-cell transcriptomes sampled over 10 embryonic 
stages (S8, S10 to S14, S16, S18, S20, and $22). T-Distributed 
stochastic neighbor embedding (tSNE) plots show increasing cell 
population structure over time. Colors indicate major tissues grouped 

by germ layer. Further details on subclustering shown in figs. S3, S5, and 
S6 and at tinyurl.com/scXen2018. 
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between bifurcating cell states, to assess gene 
expression changes along the developmental 
history of each lineage, and to automatically 
identify specific marker genes for each cell state 
(fig. S11). We have tabulated genes differential- 
ly expressed at every embryonic cell-fate choice 
(see Data S3), indicating potential fate regu- 
lators, as well as marker genes for every cell 
state (see Data S4). The complete data set is 
available through an interactive online browser 
(tinyurl.com/scXen2018), which supports vi- 
sualization of gene expression across the cell- 
state tree, identification of enriched genes in 
specific states, differential expression analysis 
between cell states, coexpression analysis of 
gene pairs, and visualization of the dynamics 
of gene expression along particular cell-state 
tree branches. Interactive plots of single cells 
from each stage are also available through this 
browser. 


Divergence of developmental 
gene expression between frog 
(amphibians-anurans) and fish (teleosts) 


There are numerous cladograms of gene se- 
quence that confirm the basic paleontological 
record of vertebrates, reflected in clear differ- 
ences as well as similarities between Xenopus 
and zebrafish in their sequenced genomes. But 
whenever there is a change in anatomy, cell 
type, and physiology there must be an underly- 
ing alteration in the developmental program. 
Time-series single-cell gene expression mea- 
surements offer an opportunity to look deeply 
into these developmental gene expression pro- 
grams and give us clues about the most conse- 
quential evolutionary changes across species. We 
first asked whether the same (i.e., orthologous) 
genes are conserved in developing tissues in frog 
and fish using data from our companion paper 
on zebrafish development (7), which spans a de- 
velopmental period similar to that of the frog 
dataset. We processed the zebrafish time series 
through the same tree-building algorithm over 
annotated cell states. We manually aligned cell 
states between species (Fig. 4A, red shading) on 
the basis of tissue name, marker gene expres- 
sion, developmental stage, and lineage relation- 
ships. In some cases, nomenclature of homologous 
tissues differed slightly between species. Thirty 
tissues could be matched with high confidence, 
jointly covering 66 of 87 Xenopus states (79% of 
cells) and 83 of 122 zebrafish states (83% of 
cells). The cell-state tree alignment indicated 
broad conservation of lineage topologies between 
species, although the observed abundances of 
cell types differed markedly between species, 
and the proportion of matched states and cells 
decreased over time as species-specific features 
accumulated—such as specialized epidermal cell 
types in both species (fig. S12). Notably, the 
zebrafish neural ectoderm grew to 60% of se- 
quenced cells in the embryo at 24 hpf, com- 
pared with 31% in the stage 22 frog (fig. S13). 
Differences in cell survival rates during InDrops 
processing in either species could affect estimates 
of cell-state abundances. 
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The alignment of the cell-state trees also 
highlighted significant changes in developmental 
patterning between the two species, which could 
further be understood by deeper inspection of 
the underlying single-cell data. In the epidermis, 
for example, the two species show a radiation 
of common and distinct cell types from gata2- 
expressing non-neural ectoderm (Fig. 4B and 
fig. S14). Trajectories of differentiating ionocytes 
can be seen in both species, whereas ciliated cells 
expressing foxjl, goblet cells expressing itIn1, and 
small secretory cells expressing met (fig. S14), 
each of which is an important component of the 
Xenopus epidermis, are absent in zebrafish. These 
different epidermal cell types likely reflect the 
demands of development in different environ- 
ments, potentially requiring an early immune 
barrier in the frog but not the fish. Rearrange- 


ments of lineage topology can also be understood: 
Xenopus and zebrafish both produce specialized 
hatching glands (HG) that secrete shared hatch- 
ing enzymes and that are specified by the conserved 
transcription factor kIf17, seen by scRNA-seq 
(fig. S15). Single-cell trajectories link the HG to 
non-neural ectoderm in Xenopus but to the or- 
ganizer mesoderm in zebrafish (Fig. 4A and fig. 
$15), reflecting a known difference in HG germ 
layer origin between amphibians and teoleosts (15). 

Other more subtle changes in lineage topology 
involved the addition or loss of specialized pro- 
genitor cell states within a partially shared develop- 
mental history. For example, xanthoblasts appear 
at 18 hpf in zebrafish from an early neural crest 
population but do not appear until stage 46 in 
Xenopus (16), after the shared ancestral neural 
crest population has progressed through multiple 
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Fig. 2. Inference of developmental cell-state transitions from gene expression similarity. 

(A) Schematic of the mapping algorithm used to make similarity connections between clusters 
across time. (B) Global visualization of single cells profiled in the Xenopus developmental 

time course using a k-nearest neighbor graph (7). (C) Cell-state tree showing all inferred 
developmental transitions. Generated by applying the mapping algorithm in (A). (D) Representative 
cell voting outcomes between time points, generated during state tree construction. (E) Single-cell 
visualization of a representative subtree, showing lateral and intermediate mesoderm fates. 

Lines indicate corresponding topology of the cell-state tree. (F) Marker gene expression associated 
with the formation of each intermediate mesoderm cell state. 
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additional specialized neural crest intermediate 
states. Similarly, myeloid cells appear in Xenopus 
at stage 14 from early involuted ventral meso- 
derm, whereas they (macrophages and leuko- 
cytes) appear only at 18 to 24 hpf in zebrafish, 
after the ventral mesoderm has differentiated 
through specialized cardiovasculature and lateral 
plate intermediate states. Terminal differentia- 
tion of myeloid cells in both cases involves ac- 
tivation of several conserved master regulators, 
including cybb, cyba, spib, and cebpa. These ob- 
servations support a degree of independence 
between the differentiation path characterizing a 
lineage and the activation of particular conserved 
terminal cell phenotypes (17). Variation in dif- 
ferentiation paths can reflect not only drift or 
flexibility in developmental programs but also 
selected features of embryonic cells, including 
roles in interacting with and instructing other 
differentiation and morphogenetic mechanisms. 

To obtain a systematic view of similarities and 
differences in developmental gene expression 
across species, we used the aligned cell-state trees 
to investigate whether orthologous genes show 
correlation in expression values across matched 
embryonic tissues (Fig. 4C). Focusing on the sub- 
set of genes that were robustly expressed and 
dynamically varying in both species, we observed 
that orthologous genes correlated in their ex- 
pression between species (Fig. 4C), seemingly 
supporting the notion that cell-state gene ex- 
pression is largely conserved. However, only a 
minority (30%) of orthologous pairs were cor- 
related with high confidence [5% false discovery 
rate (FDR)] compared with random pairs. For 
example, in contrast to the well-conserved (7 = 
0.9) neuroectodermal marker sox2, we identified 
many examples of poorly correlated genes such 
as the transcription factor gata5 (7 = 0.1) (Fig. 4C). 
Despite being a conserved endoderm and cardiac 
master regulator, gata5 is also expressed in the 
erythropoietic tissue only of Xenopus (18) and 
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Fig. 3. scRNA-seq detects early transcriptional events during 


specification of embryonic cell states. (A) Time 


each cell state in the cell-state tree as compared with documented 
appearance times in the Xenopus anatomy ontology (XAO). Red/blue 
points are detected early/late in sCRNA-seq as compared with XAO. Sixty 
of 69 states appear as early or earlier than documented. Error bars 
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in the marginal zone only of zebrafish (19) at a 
distinct spatiotemporal location of endoderm 
specification compared with the vegetal expres- 
sion seen in Xenopus. Accordingly, we were 
surprised to find that the genes best marking 
cell states within one species did not generally 
perform well at marking the same cell states 
within the other (fig. S16). 

We tried to understand what properties pre- 
dict whether a gene shows conserved expres- 
sion patterns across tissues across species—i.e., 
whether specific functional categories of genes 
are more likely to be conserved in expression dy- 
namics. Analysis of gene ontology annotations 
spanning diverse cellular processes revealed a 
striking enrichment of gene expression conserva- 
tion among transcription factors, identifying three 
statistically significant (P < 0.05; binomial-test, 
Bonferroni corrected) functional annotations that 
include “nucleus,” “regulation of transcription,” 
and “transcription factor activity’ (Fig. 4D). By 
contrast, protein sequence conservation was un- 
correlated to gene expression conservation (7 = 
0.01; P = 0.6). Orthologous genes with 20 to 
40% sequence identity had the same expression 
conservation as those with 95 to 100% sequence 
identity (average expression conservation of 0.41 
versus 0.38, P = 0.5; t test). A gene’s function is 
therefore more strongly predictive of its con- 
servation in developmental gene expression pro- 
grams across species than conservation of its 
protein sequence, decoupling two fundamental 
processes underlying evolutionary changes in 
embryonic development. 


Reuse of developmental 
transcription factors 


Given the close association and conservation of 
transcription factors with differentiation, and 
hence cell type, we asked two questions: (i) How 
are transcription factors deployed across cell 
types and across time during development? (ii) 


B C 


How, at the level of genes, do new cell iden- 
tities emerge at branch points in development? 
These questions can be examined here in dozens 
of cell states and in the unperturbed setting of 
the embryo, rather than in cell culture. 

Classical experiments have described differen- 
tiation in terms of the activation of single tran- 
scription factors, called master regulators, which 
established cell identity and after which other 
transcription factors progressively refined the 
phenotype (20) (Fig. 5A). Alternative views, how- 
ever, ascribe the initial specification of cell identity 
to interactions among multiple transcription 
factors (TFs), a “combinatorial code,” capable 
of defining a larger number of identities with 
different combinations from a small set of TFs 
(21, 22) (Fig. 5B). We asked which of these views 
best describes early development. This is not just 
of theoretical interest, because knowing which 
transcription factor(s) could specify a given cell 
type could aid in the formulation of combina- 
tions that could be used therapeutically (23). Our 
experiments identify some cases where candi- 
date TFs are activated just once in early develop- 
ment and after which they persist and may define 
a lineage, suggesting a master regulator model. But 
we also see TFs that are expressed more than once 
de novo in distinct and unrelated cell states, which 
may support a model of combinatorial deployment. 

To understand the prevalence of these modes 
of TF use, we scored the number of times a TF 
was expressed independently—i.e., in states that 
share a nonexpressing common ancestor. From 
this analysis, we found that the expression of half 
of developmentally variable TFs is initiated only 
once during early development (52% in frog, 54% 
in zebrafish) (Fig. 5C). The detected single-use TF 
induction events occurred in just 35% of an- 
notated cell states, seemingly inconsistent with 
the view that distinct master TFs define each 
fate. Reused TFs, by contrast, cover 58% of tissue 
annotations in a complex combinatorial code 
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Developmental stage 


represent time interval of scRNA-seq experiment. (B and C) scRNA-seq 
reveals an early endothelial/hemangioblast progenitor that appears 

at stage 18 (red lineage), as compared with stage 26 for 
hemangioblasts and stage 31 for endothelial cells [XAO (12, 13)], 

with recognizable activation of the endothelial/hemangioblast gene 
expression program (C). 
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(see Data S5). In the remaining 7% of cell states, a 
specific TF induction was not detected. Over 
time, the average fraction of transcription factors 
being reused in a second circumstance climbs, 
reaching ~90% of new TF expression initiation 
events by stage 20 (18.5 hpf) in Xenopus and by 
20 hpf in zebrafish (Fig. 5D). Together these re- 
sults underscore the importance of combinatorial 
TF reuse in developmental gene expression 
programs. 
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Fig. 4. Similarities and differences in developmental cell-state hierar- 
chies and gene expression between frog and fish. (A) Xenopus and 
zebrafish cell-state trees aligned by orthologous cell states (red shading). 
Gray/white stripes provide a visual guide. (B) Single-cell visualization of 
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genes. (C) (Left) Ortholog genes across species have variable conservation of 
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Consistent with the notion that combinatorial 
interactions could lead to complex gene expres- 
sion responses, the expression of the same TF in 
different tissues did not generally correlate with 
the same gene sets, as seen in the example of 
Pax8, which is independently expressed in the otic 
placode and pronephric mesenchyme (Fig. 5E). 
Some putative downstream targets of reused TFs 
could, however, be identified in more than one 
tissue, as in the case of foxjl, which is expressed 
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in both the ciliated epidermis and the floor plate 
and orchestrates motile cilia formation. Many of 
the genes correlating with foxjl transcript abun- 
dance within the floor plate also correlated with 
foxjl in the epidermis and were highly enriched 
for ciliogenesis gene ontology (GO) terms (Fig. 5F). 
Interestingly, we found that TFs deployed just 
once over the observed time series were more 
conserved in their expression pattern across the 
two species compared with reused TFs (median 
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cell-state-specific expression. Just 30% of self-similar orthologs are con- 
served at a 95% FDR compared with random gene pairs. (Right) Examples of 
highly (Sox2) and poorly (Gata5) correlated TFs across species. (D and E) 
Function, not sequence, predicts gene expression conservation. (D) Orthologs 
with highly conserved expression patterns across species are enriched in 
TF-associated GO terms. P values show Bonferroni-corrected binomial test 
results. (E) Protein sequence conservation is not correlated with gene 
expression conservation (r = 0.01; P = 0.6). 
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correlation 0.72 versus 0.48 in frog; 0.64 versus 
0.36 in fish). This may indicate that reused TFs 
are more commonly rearranged across tissues 
during evolution to generate novelties, whereas 
TFs whose field of expression is established just 
once might have more conserved regulatory 
functions. 


Multilineage gene expression priming 
during fate choices 

It has been shown in several specific cell types— 
including hematopoietic stem cells (24) and mouse 
embryonic stem cells (25)—that uncommitted cells 
may coexpress competing fate regulators asso- 
ciated with more than one terminal cell fate. 
Such gene expression overlap, followed by refine- 
ment as cells differentiate, is known as multi- 
lineage priming (MLP) (Fig. 6A). MLP has been 
argued to be a part of the process of cell-type 
decision-making, through competition between 
fate regulators, and also to play a role in spec- 
ifying an undifferentiated state by blocking line- 
age commitment (25-28). 

The importance of MLP for cell-fate decisions 
is clearly supported by these specific examples, 
but its importance would be further strengthened 
if it were generally associated with branch points 
in differentiation. Our scRNA-seq data offered an 
opportunity to assess this over entire embryos. 
Wardle and Smith had previously (29) noted 


Fig. 5. TF reuse is pervasive 

in vertebrate development. 

(A) Progressive programming 

of cell identity through sequential 
TF activation. (B) Programming 
of cell identity through 
combinatorial reuse of TFs. TF 

A is reused (red) in combination 
with TF B to generate a 

new fate. (C) Half of DE TFs 


transient overlap and then refinement between 
the ectoderm-specific TF Sox2 and the mesoderm- 
specific TF Brachyury (T) during gastrula stages 
in Xenopus. This was recapitulated in our data, 
which shows coexpression of these markers in 
26% of cells expressing either gene at stage 10, 
but refinement to just 3% overlap by stage 14 
(Fig. 6, B and C). Our data also show numerous 
other examples, including similar MLP between 
early ectodermal TF Zicl, and the mesodermal 
TF Foxcl, which overlap in expression at stage 
10 (7% coexpression) before refinement by stage 
12 (<3% coexpression) (Fig. 6B). Across all fate 
choices in the embryo, we identify 412 MLP genes 
in Xenopus (see Data S6). We find that MLP is 
initially widespread, encompassing a consider- 
able fraction of highly cell-state specific genes at 
each branch point (>4-fold differentially expres- 
sed), but reduces over time (Fig. 6, D and E). For 
fate choices that occur before or up until the end 
of gastrulation, over 70% of highly cell-state spe- 
cific genes overlapped in their progenitor state in 
fish and ~50% in frog. By contrast, for subsequent 
fate choices, MLP ultimately drops to <10% in both 
species (Fig. 6D). All differentiation branch points 
showed multilineage overlap before gastrulation, 
whereas only selected branch points showed fre- 
quent MLP after gastrulation (Fig. 6E). 

Where MLP did occur, we wondered if it re- 
flected generally noisy gene expression or, more 
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specifically, coexpression of regulators of cell 
fate. Suggestive of the latter possibility, MLP genes 
were significantly enriched for TF-associated 
GO terms, including “transcription factor activity,” 
“regulation of transcription,” “nucleus,” and “DNA 
binding,” as compared to all differentially ex- 
pressed (DE) genes (fig. S17) (two-fold enriched; 
P < 0.001 binomial test, Bonferroni corrected). 


Assessing the retention of pluripotency 
during neural crest development 


The utility of single-cell gene expression maps 
extends from studying general features of de- 
velopment to clarifying specific developmental 
relationships. Among embryonic lineages, the 
neural crest is unique in its broad fate potential, 
contributing to multiple tissues spanning germ 
layer boundaries. It remains an open question 
how this unique fate potential arises. As the 
neural crest forms (stages 13 and 14), it expresses 
at least eight pluripotency genes—foxd3, c-myc, 
id3, tfap2, ventx2, ets1, snail, and oct25—which 
are also expressed in the early blastula (stages 8 
and 9) (30). Functional assays showed that sev- 
eral of these genes are required for multipotency 
of both blastula and neural crest cells (30). This 
raised the intriguing possibility that the neural 
crest may retain multipotency from the blastula 
stage (a “retention model”), in contrast to the 
classical view that a field of ectodermal tissue 
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Fig. 6. Refinement of promiscuous multilineage gene expression 
during early embryonic fate choices. (A) Illustration of multilineage 
priming (MLP). Two genes, specific to daughter states A and B, 
respectively, transiently overlap in the ancestor progenitor state as a 
fate decision is being made. (B and C) MLP during the fate choice 

in Xenopus. Sox2 


between neural plate and dorsal marginal zone 


reacquires multipotency during early neurulation 
(Fig. 7A). However, this model remains a hypo- 
thesis because a multipotent intermediate state 
between blastula and neural crest stages has 
not been directly identified. 

Our single-cell data offers an opportunity to 
reexamine these alternative hypotheses with a 
high-resolution view of the intermediate states 
between blastula and neurula stages in neural 
crest development. To support the “retention 
model,” we specifically searched for a subset 
of cells that maintain a blastula gene expression 
program, distinct from the remaining early ec- 
toderm. To further increase the likelihood of 
finding such cells, we supplemented our whole 
embryo time series with additional scRNA-seq 
data (9308 cells) collected from tissue that we 
dissected from the neural plate border region, 
which is fated to become neural crest, of stage 
11 embryos, immediately before the expression 
of neural crest genes (total of 15,426 stage 11 
cells). These data in frog, and matching data in 
fish, failed to reveal a distinct cluster of cells de- 
fined by expression of the proposed pluripotency 
genes, nor a cluster defined by any other set of 
genes enriched in the blastula. Instead, the data 
supported a more conventional differentiation 
pathway through clearly identifiable neuroecto- 
dermal intermediates (Figs. 7B and 4A). The 
same results held when examining differentia- 
tion progression not at the cell cluster level but 
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at single-cell resolution (Fig. 7C). Far from show- 
ing a gene expression program that persists from 
the blastula stage, we found that the inferred 
precursors of the neural crest showed large 
changes in gene expression with hundreds of 
dynamically varying genes (Fig. 7D). The pro- 
posed suite of eight pluripotency genes, in par- 
ticular, were not limited to a single cluster of 
cells but were broadly expressed across most 
of the ectoderm, as well as in nonpluripotent 
states such as the endoderm and mesoderm at 
stage 11 (Fig. 7E) (see fig. S18 for individual 
genes). This broad expression pattern would 
have been more difficult to detect without single- 
cell data. The original experiments (30) could 
not weigh equally the expression across all tis- 
sues because the whole-mount in situ staining 
used emphasizes superficial tissues. 

We conclude that, at least at the level of 
transcription, there is no evidence of a distinct 
expression program that persists from the blas- 
tula to give rise to the neural crest. We fail to see 
such cells both using unsupervised approaches 
(clustering) and by examining the set of eight 
shared genes previously proposed to maintain 
pluripotency. It is not possible to completely 
rule out the retention of a blastula-like pluri- 
potency program in neural crest precursors from 
our data: If functional pluripotency was main- 
tained by chromatin state or posttranslational 
modification, it may be undetectable by scRNA- 
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and T, as well as Zicl and Foxcl, overlap in progenitor cells before 
becoming specific. (D and E) Global patterns of MLP in early 
development. (D) Multilineage primed genes are initially pervasive 
among DE genes at fate branch points but become progressively rarer. 
(E) MLP frequency shown for each cell-fate choice on the cell-state 
trees indicates sporadic MLP at later time points. 


seq. Nevertheless, these new studies argue against 
a persistent transcriptional program preserved 
from an earlier stage uniquely in the neural 
crest. Rather, we argue in favor of a more con- 
ventional view of neural crest development pro- 
ceeding from a well-defined ectodermal lineage. 


Discussion 


Embryonic development involves a carefully 
timed set of changes in cell behavior that drives 
the egg to a complex spatial and compositional 
pattern of cell types. A biochemical dissection 
of the underlying processes in development is 
complicated by limited material in the embryo 
and the heterogeneity of its composition, both 
of which must be considered together. Methods 
such as in situ hybridization have bridged these 
difficulties and can now be performed quantita- 
tively and simultaneously over many genes (3/, 32). 
However, registering such spatial information at 
single-cell resolution can be challenging, particu- 
larly in three-dimensional tissues. In an alternative 
approach, we and others have developed single-cell 
transcriptomic methods, which sacrifice spatial 
information and in return provide a universal 
modality of measurement that is easily adapted 
to diverse situations (J, 33). In a short time, these 
methods have been widely deployed and excelled 
in revealing cell population structures and dy- 
namics (17, 34, 35). The resulting cell atlases can 
be computationally related to spatial structure 
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using extensive in situ expression databases such 
as Xenbase (36, 37). For embryos, applying scRNA- 
seq involves important considerations. The method 
should not preferentially select a single cell type. 
It should be highly efficient in yield of cells. The 
dissociation procedure adopted should be quick 
and complete. Background RNA released by lysed 
cells in the sample should be minimized. These 
problems are acute for Xenopus embryos because 
it has large yolk-filled blastomeres that are sen- 
sitive to shear forces and handling. Yet these 
problems have been overcome with a method 
of efficient capture and very little cell lysis. 

We also developed analytical methods for 
interpreting the time-course measurements of 
single-cell gene expression in the embryo. Large 
temporal differences obscured temporal map- 
pings between cell states using established tools. 
This challenge was solved in our analysis by 
using shared latent spaces to measure similarities 
between states across adjacent time points. Upon 
linking cell states over time based on their gene 
expression, we found a strong match to known 
lineages. There were some differences that could 
be explained by the heterochronic formation of 
the same cell type, such as in the differentiation 
of the somite tissue, and in some cases the tran- 
scriptional cell states did not cluster along spatial 
boundaries. Yet in other aspects the data showed 
high sensitivity and could detect the first ap- 
pearance of numerous cell states far earlier than 
previously known. As shown for the neural crest, 
the sensitivity and single-cell resolution of our 
gene expression resource can be used to test 
specific hypotheses about fundamental develop- 
mental processes. 

In this study, we looked for general features 
of development that might be difficult to ap- 
preciate from examination of individual line- 
ages. In both Xenopus and zebrafish, we observed 
a transition from pervasive multilineage gene 
expression during gastrulation to more specific 
and combinatorial deployment of differentiation 
programs later. In particular, newly induced 
transcription factors increasingly demarcated 
multiple independent cell fates in the embryo 
after gastrulation, as transcriptional states within 
each germ layer are further specialized. The drop 
in the frequency of multilineage priming, and 
the increase in combinatorial TF deployment, 
co-occur with the formation of the first discrete 
embryonic cell states. In Xenopus, this transi- 
tion appears to be particularly switchlike, occurr- 
ing between stages 12 and 14, just 2.5 hours apart 
in time. Curiously, the timing of appearance of 
distinct cells states correlates precisely with that 
of cell-fate commitment in Xenopus (38, 39). The 
global nature of this transition is reminiscent of 
the midblastula transition, which occurs in stage 
8 Xenopus embryos, when the gradual titration 
of DNA-to-histone ratios during blastula-stage 
cleavage cycles abruptly drives transcriptional ac- 
tivation, a longer cell cycle, onset of asynchronous 
cell divisions, and the start of cell motility across 
the embryo (2, 3). We do not know whether any 
single mechanism coordinates the rapid appear- 
ance of multiple cell types in the postgastrula 
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embryo, however. A candidate process could be 
the formation of heterochromatin domains, which 
occurs on the appropriate time scale (40, 41), and 
could facilitate both the refinement of lineage 
program conflicts (42) and the combinatorial 
reuse of transcription factors by restricting their 
targets in a tissue-specific manner (43, 44). 

In contrast to these seemingly conserved global 
features of transcriptional dynamics in embryos, 
we found that the tissue-specific expression of 
individual genes was relatively variable across 
species. Rather than preserving global transcrip- 
tional profiles, orthologous cell states maintained 
expression of just a subset of genes, which were 
most notably enriched for TFs, and among TFs, 
those that are used in a single tissue rather than 
those that are combinatorially reused across line- 
ages. Developmental programs thus appear to 
preserve an underlying core set of transcriptional 
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programs that define cellular hierarchies while 
exploring significant variation in the way most 
genes are deployed across tissues to define cel- 
lular phenotypes. This finding supports previous 
arguments that cell identities and phenotypes 
may be decoupled across evolution (45). Indeed, 
new cell states and changes in cellular hierar- 
chies across species could be associated with the 
deployment of TFs in novel locations or combi- 
nations and concomitant acquisition of batteries 
of effector genes. We found that this expression 
plasticity is independent of variation in protein 
sequence itself, surprisingly decoupling a gene’s 
structure from its expression pattern in the 
embryo across evolution. Taken together, the 
approaches and analyses presented here estab- 
lish first steps toward a data-driven dissection 
of developmental programs and how they change 
across species. 
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Fig. 7. Assessing the retention of pluripotency during neural crest development. (A) Contrasting 
models of neural crest development. (Model 1) Neural crest emerges from an intermediate 

population that retains blastula pluripotency (29). (Model 2) Neural crest emerges from ectoderm and 
reactivates pluripotency. (B) Ancestors inferred from scRNA-seq support model 2, where neural crest 
derives from neural cells at the neural plate border. (C) Single-cell visualization (using SPRING) of 
neuroectoderm, non-neural ectoderm, and neural crest also indicates that neural crest derives from the 
neural plate border. (D) Neural crest differentiation involves hundreds of >3-fold dynamic marker 
genes. (E) At stage 11, the shared pluripotency circuit proposed by Buitrago-Delgado et al. (30)—foxd3, 
c-myc (myca), id3, tfap2a, ventx2.1, ets1, snail, and pou3f5.2—is expressed broadly in nonpluripotent 
cells. Score shows normalized aggregate expression; see fig. S18 for individual genes. 
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SINGLE-CELL ANALYSIS 


Single-cell mapping of gene 
expression landscapes and lineage 
in the zebrafish embryo 


Daniel E. Wagner, Caleb Weinreb, Zach M. Collins, James A. Briggs, 


Sean G. Megason,* Allon M. Klein* 


High-throughput mapping of cellular differentiation hierarchies from single-cell data promises 
to empower systematic interrogations of vertebrate development and disease. Here we applied 
single-cell RNA sequencing to >92,000 cells from zebrafish embryos during the first day of 
development. Using a graph-based approach, we mapped a cell-state landscape that describes 
axis patterning, germ layer formation, and organogenesis. We tested how clonally related 

cells traverse this landscape by developing a transposon-based barcoding approach 
(TracerSeq) for reconstructing single-cell lineage histories. Clonally related cells were often 
restricted by the state landscape, including a case in which two independent lineages converge 
on similar fates. Cell fates remained restricted to this landscape in embryos lacking the chordin 
gene. We provide web-based resources for further analysis of the single-cell data. 


major goal of developmental biology is to 

understand the progression of embryonic 

cell lineages from pluripotency to adult- 

hood (1). Fate mapping and analysis of 

mutant phenotypes have explained much 
of what we know of development, yet we still 
lack a systematic atlas of all cell states in a de- 
veloping embryo. Owing to technical advances 
in single-cell RNA sequencing (scRNA-seq) (2-6), 
it is now possible to assemble comprehensive 
single-cell atlases describing complex and dynam- 
ic in vivo biological processes. Here we utilized 
inDrops scRNA-seq (4, 7) to collect more than 
92,000 single-cell transcriptomes from dissociated 
wild-type and mutant zebrafish embryos during 
the first 24 hours of embryonic development 
(Fig. 1 and fig. S1). For different developmental 
stages, we sampled 17 to 97% of the total cells per 
embryo, sufficient to detect cell states as rare as 
0.1 to 0.5% of all cells (fig. S1C), including germ 
cells, which were detected in all time points (Fig. 
1B and table S2). From this dataset, clustering of 
the wild-type transcriptomes revealed an expand- 
ing set of epidermal, neural, mesodermal, and 
endodermal cell states over developmental time, 
many of which could be specifically annotated on 
the basis of expression of marker genes (Fig. 1B, 
fig. S2A, and table S2) (8). We collected seven bio- 
logical replicates for the final time point [24 hours 
postfertilization (hpf)], which demonstrated con- 
sistency of both transcriptional signatures and 
cell-state proportions across independent speci- 
mens (fig. S2, B and C). 
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A single-cell graph of cell-state 
progression in the developing 

zebrafish embryo 

We sought to map trajectories of cell state 
during development by linking cell states across 
time. Several computational approaches exist to 
infer orderings of asynchronous processes from 


scRNA-seq data (9-11), typically by projecting 
all cells into a single low-dimensional latent 
space. Such strategies may be ill-suited to map 
gene expression in developing embryos, which 
exhibit dramatically increasing cell-state dimen- 
sionality and continuous changes in the sets and 
numbers of cell state-defining genes (fig. $2, 
D and E). To overcome these obstacles, we de- 
veloped a graph-based strategy for locally em- 
bedding consecutive time points on the basis 
of the biological variation that they share, rather 
than using a global coordinate system for all time 
points. This approach first constructs a single- 
cell k-nearest neighbor graph for each time point 
t;, with nodes representing cells and edges that 
link neighbors in a low-dimensional subspace; it 
then joins the graphs by identifying neighboring 
cells in pairs of adjacent time points, by using a 
coordinate system learned from the future (¢;,1) 
time point (see methods). The resulting graph 
spans all time points and allows application of 
formal graph-based methods for data analysis. 
When applied to our zebrafish data, the full 
graph forms a branching network (Fig. 2A). In- 
spection of numerous domain and cell type- 
specific transcriptional markers shows that major 
initial branches represent neural, epidermal, and 
mesendodermal states undergoing progressive 
and spatially restricted differentiation (Fig. 2, 
B and C, and fig. $3). We also noted distinct and 
early branching events for germline, notochord, 
enveloping layer epidermis, and the prechordal 
plate. 

To test whether this graph recapitulates known 
lineage relationships, we used a measure of graph 
distance (diffusion pseudotime, or DPT) (72) to 
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Fig. 1. A single-cell transcriptional atlas of the zebrafish embryo. (A) Experimental workflow. 
Single-cell suspensions were dissociated from staged zebrafish embryos and introduced into the 
inDrops microfluidic device. Single-cell transcriptome libraries were prepared and sequenced by 
RNA-seq. (B) tSNE maps for each time point, constructed in dimensionality-reduced principal 
component analysis subspace defined by highly covariable genes (see methods). Cells are colored by 
germ layer identities inferred from expressed marker genes (see also fig. S2A and table S2). 
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explore long-range temporal connections between 
cell states. Cell states of the early gastrula (shield 
stage, 6 hpf) are defined largely by positional 
marker genes (Fig. 3A), yet these cells are con- 
nected, through the single-cell graph, to tissue- 
specific states that emerge later (for example, 
pharyngula stage, 24 hpf). We found that the 
shield-stage cells with the shortest mean graph 
distance to each particular 24-hpf tissue were 
clustered and expressed spatial marker genes 
predicted from previous in vivo fate-mapping 
studies (13-16); for example, 24-hpf neural tissues 
mapped to the 6-hpf dorsal anterior epiblast (Fig. 
3B and fig. S4). Conversely, direct comparison 


of 6- and 24-hpf gene expression states failed to 
capture lineage relationships (Fig. 3B and fig. S4, 
blue points). 

We next tested the extent to which the single- 
cell graph represents a simple treelike hierarchy 
of discrete states. For this, we “coarse-grained” 
the graph by collapsing groups of similar cells 
into state nodes; edges between state nodes were 
weighted by the number of original single-cell 
connecting edges. A spanning tree was then traced 
through the most densely weighted edges to a 
4-hpf root state (Fig. 3C and fig. S5A). This span- 
ning tree (the “state tree”) reflects many specific 
aspects of early development. In the neural plate, 


we observe notable branch points for the optic 
cup, diencephalon, telencephalon, mesencephalon, 
and rhombencephalon, with associated states for 
region-specific postmitotic neurons (for example, 
eomes* and dixI* neurons in distinct forebrain 
branches). The neural plate also includes neural 
crest, which branches to include cell states for 
melanoblasts, iridoblasts, and xanthoblasts. In the 
lateral plate and ventral mesoderm, the state tree 
encodes extensive branching into hematopoietic 
cells, endothelial cells, heart, pharyngeal arches, 
the pronephritic duct, and fin buds. In the endo- 
derm, two branch points give rise to cell states for 
pancreatic primordium (which includes insulin* 
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Fig. 2. Single-cell graph reveals a continuous developmental land- 
scape of cell states. (A) Overview of graph construction strategy and a 
force-directed layout of the resulting single-cell graph (nodes colored by 
collection time point). For each cell, up to 20 within— or between-time 
point mutual nearest neighbor edges are retained. sc-kNN, single 
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cell-k-nearest neighbor; EVL, enveloping layer epidermis. (B) Single-cell 
graph colored by germ layer identities inferred from differentially 
expressed marker genes (see table S2). (C) Single-cell graphs, colored by 
logio expression counts for indicated cell type-specific marker genes. 


UMI, unique molecular identifier. 
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cells) and the pharyngeal pouch. In the epi- 
dermal lineage, branch points differentiate the 
otic placode, lateral line, ionocytes, and several 
states expressing markers for annotated mucous- 
secreting cells (8). To facilitate data exploration, 
we developed web-based interfaces for the state 
tree and the full single-cell graph (www.tinyurl. 
com/scZfish2018). These tools permit interactive 
examination of the inferred state hierarchy, ex- 
pression for any gene of interest, and differential 
expression analysis between states, state combi- 
nations, or single cells. 

Although many major cell-state transitions 
are captured in the state tree, more complex fea- 
tures are evident in the coarse-grained and single- 
cell graphs. Off-tree interconnections between 
states, for example, were evident for the neural 
crest and pharyngeal arches, spinal cord and 
somitic mesoderm, the neural plate, and others 
(Fig. 8C and fig. S5A). To formalize the degree 
to which the developmental landscape can be 
approximated as a hierarchy with discrete, non- 
looping branches, we defined a “canalization 
score” (Fig. 3D, see legend for definition), which 
reflects the off-tree connectivity of each coarse- 
grained state node. This analysis revealed wide- 
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Anterior (otx1a) 
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Anterior 
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amar 


spread regions of low canalization, particularly 
in the neural plate and somitic mesoderm. These 
observations suggest that, in contrast to the classic 
notion of a cell lineage, the zebrafish cell-state 
landscape cannot be fully represented as a tree. 


Cell-lineage history does not invariantly 
reflect cell-state graph topology 


Although the single-cell and coarse-grained graphs 
represent an inferred landscape of developmental 
cell states, they do not reveal how individual cells 
traverse these states. A simple prediction would 
be that individual cell histories mirror graph to- 
pology. We tested this prediction by developing 
an inDrops-compatible strategy for recording 
in vivo lineage histories at the single-cell level: 
sequencing of transcribed clonally encoded ran- 
dom barcodes (TracerSeq). TracerSeq utilizes the 
Tol2 transposase system (7) to randomly integrate 
green fluorescent protein (GFP) reporter cassettes 
driven by the B-actin promoter (actb2) into the 
zebrafish genome. To render each integration 
event unique and detectable by RNA-seq, we 
utilized Gibson assembly (78) without subsequent 
amplification to introduce a random 20-nucleotide 
oligomer sequence barcode into the GFP 3’ un- 


@ 4hpf @ 10hpf 
@6hpf @ 14hpf 
© 8hpf @ 18hpf 
@ 24hpf 


a : ae 

S Margin/ in BU 

> i Heart 

| a Posterior “ 

Ventral i (cdx4) "dom, 

3 

° 

s 

a 


sc-kNN gene 
graph expression 
distance correlation 


\ 


Midbrain Muscle-myl1 


é 
li 


Heart 


a 
“ 
ee er 


24hpf Epidermis Endoderm 


(1dq) sil2D 001 dol @ 


(d) sil@D 001 dol @ 


° 


», 
Spinal Cord ppomitic ly Sp, 
jesoderm Os. 


Canalization Score 


Fig. 3. Single-cell and coarse-grained graphs encode progenitor-fate relationships. (A) tSNE 
map of 6-hpf epiblast and hypoblast states, colored by normalized transcript counts for select 
positional marker genes. Overlapping color gradients demonstrate continuous expression domains 
defined by position. Diagram relates positions of cells in the tSNE map to theoretical positions 

in the embryo. 2D, two dimensions. (B) In silico fate predictions for 6-hpf embryo cells. The top 
100 cells with predicted 24-hpf fate outcomes are indicated for shortest graph diffusion distances 
(red) or direct single-cell gene expression correlation distances (blue) between 6-hpf cells and 
24-hpf cluster centroids. p, Pearson correlation. (©) Construction and overview of the coarse-grained 
graph (see also fig. S5). Nodes indicate states (groups of transcriptionally similar cells), colored 
by time point. Weighted edges connect similar states within or between time points. Spanning tree 
edges connecting each node to the 4-hpf root state through the top weighted edges are highlighted 
in dark gray. (D) Coarse-grained graph nodes are colored by a canalization score, defined as 

the ratio of diffusion distances between each node (DPT,;) and the 4-hpf root node through state 
tree edges only (DPTscarz) versus through all graph edges. Highly canalized regions of the graph 
correspond to branches with the fewest off-tree edges. 
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translated region (Fig. 4A and fig. S6). Because 
transgenic insertions can occur asynchronously 
over successive cell divisions, TracerSeq barcodes 
can facilitate the construction of lineage trees 
(Fig. 4A). TracerSeq offers an advantage over 
related Cas9-based approaches (19, 20), which 
can generate identical edits and/or large barcode 
deletions in independent lineages at nontrivial 
frequencies. By contrast, TracerSeq barcodes are 
uniformly distributed over a large sequence space 
(for example, 4° nucleotide combinations = 10” 
unique barcode sequences), facilitating straight- 
forward calling of genetic clones (fig. S7). The 
small (20-base pair) locus size also greatly sim- 
plifies the construction, sequencing, and analysis 
of TracerSeq inDrops libraries. 

The use of TracerSeq to analyze potentially 
small clones of cells (each restricted to a single 
embryo) requires high-efficiency tissue dissocia- 
tion and transcriptomic barcoding methods. We 
therefore optimized a high-yield cell dissociation 
and recovery protocol for individual 24-hpf zebra- 
fish embryos (fig. S1D and methods) and lever- 
aged the high cell barcoding efficiency (>80%) of 
the inDrops platform (7). We then sequenced in- 
dividual embryos (n = 5) at 24 hpf (fig. $7) that 
were injected at the one-cell stage with the 
TracerSeq library, generating combined lineage 
and transcriptome datasets for 1269 clonal bar- 
codes distributed over 4342 single cells (fig. S8). 
Of these cells, 2361 (54%) were each marked by 
>2 distinct barcode integrations; 624 cells (14%) 
were marked by =5 integrations (fig. S8). Hier- 
archical clustering of TracerSeq barcodes orga- 
nized these cells into more than a hundred distinct 
founder clones with internal nested clone structures 
(Fig. 4B and fig. S9, A to D). We then compared 
the lineage history and inferred transcriptional 
history of each founder clone by embedding its 
constituent cells onto the single-cell graph (Fig. 4C). 
We found that the largest clones often marked a 
wide diversity of cell states. In multiple cases, 
however, additional barcode integrations in the 
same founder clone marked cells that were state 
restricted. For example, one such clone (34F1) 
marked cells of the neural plate, epidermal tissues, 
and muscle but contained a subclone restricted 
to ectoderm. Similar lineage restriction events 
could be described for other founder clones (Fig. 
4C). These observations suggest that the current 
timing of TracerSeq integrations encompasses 
the transition from unrestricted pluripotency to 
the first fate-restriction events appearing in the 
zebrafish embryo. 

To investigate lineage relationships more sys- 
tematically, we assessed the likelihood of recov- 
ering shared TracerSeq barcodes from all pairs 
of transcriptional states in the 24-hpf zebrafish 
embryo. We first calculated a lineage coupling 
score (fig. S9E and methods), defined as the num- 
ber of shared barcodes relative to randomized 
data (z-score standardized), with values ranging 
from positive (coupled fates) to negative (anti- 
coupled fates). Hierarchical clustering of the 
pairwise correlation between coupling scores re- 
vealed structured groups of cell states (Fig. 5A), 
which comprised related tissues and/or inferred 
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germ layer derivatives. These included one dis- 
tinct group that contained both mesodermal and 
endodermal derivatives, four groups containing 
ectodermal derivatives, and two groups contain- 
ing mixtures of ectoderm and mesoderm. Several 
of these lineage groups are corroborated by prior 
fate-mapping studies. We discuss here three ex- 
amples. The first major lineage group (MesEndo) 
includes derivatives of both lateral plate meso- 
derm and endoderm. These tissues originate from 
the marginal blastomeres of the early zebrafish 
gastrula, which involute first during gastrulation 
to form the hypoblast and then rapidly migrate 
toward the animal pole (13, 15, 21). The observed 
lineage isolation of these tissues is thus con- 
sistent with an early spatial partitioning of this 
region, further reflected in Fig. 5A by negative 
lineage correlations to most other states. A sec- 
ond group (Fig. 5A, Ecto III) captures strong 
lineage couplings both between anterior neural 
tissues—including the optic cup, midbrain, and 
telencephalon (16)—and to anterior epidermal 
derivatives such as the olfactory placode (22). 
These tissues are coupled to a lower degree with 
another group (Ecto II), which includes cou- 
plings between the hindbrain, spinal cord, and 
neural crest (grem2*). The third example we note 
is a group coupling ectoderm and mesoderm 


Fig. 4. Single-cell transcriptomic A 
barcoding of cell lineages using 
TracerSeq. (A) Method overview. 
Tol2 transposase system integrates 
barcode-containing GFP reporter 
cassettes into zebrafish genome. 
Asterisks denote integration events. 
Colors (red, blue, black, and green) 
indicate unique barcode sequences. 
(B) Clustered heatmap for one 

of five TracerSeq embryos (see also 
fig. S9, A to D), displaying lineage 
and transcriptome information 

for each cell. Heatmap rows are single 
cells for which both transcriptome 
and >1 TracerSeq barcodes were 
recovered. Columns denote unique 


(Fig. 5A, MesEcto II), including muscle (my/I*), 
myotome, spinal cord, posterior neural crest, and 
epidermal states. These correlations mirror the 
development of posterior body regions, which 
trace their origins to blastomeres proximal to 
the medial and ventral margin (1/3). These 
mesodermal-spinal cord couplings might also 
be explained by the presence of a later popula- 
tion of transient, multipotent neuromesodermal 
progenitor cells in the embryonic tailbud, which 
give rise to both of these populations (23-25). 
Interestingly, these lineage groups tend to be 
organized by position (for example, along the 
anterior-posterior axis) rather than strictly by 
germ layer or tissue origin (for example, neural, 
epidermal, and mesodermal). 

We next questioned how clonal relationships 
compared with cell-state relationships. A sim- 
plistic model of development is that cells prog- 
ressively diverge in state as they diverge in lineage. 
Developing embryos, however, could violate this 
prediction in at least two ways: First, clonally 
distinct embryonic fields can give rise to similar 
cell types (“convergent clones”); second, major 
transcriptional changes might drive related cells 
into qualitatively dissimilar states, possibly even 
late in development (“divergent clones”). Over- 
laying TracerSeq lineage correlation scores on 
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TracerSeq barcodes (left: black 
squares, 21 UMI) and tissue identities 
(right: red squares) inferred from 
cluster annotations (table S2). Heat- 
maps were clustered using Jaccard 
similarity and average linkage. 

(C) Examples of TracerSeq founder 
clones with positions of constituent 
cells (colored nodes) overlaid on the 
single-cell graph. Graph edges are 
shown in dark gray. Colors indicate 
the first lineage bifurcation within 
each founder clone. In the three cases 
shown, the founder clone included 
cells that differentiated into both 
ectodermal (E) and mesodermal (M) 
states, whereas one of the two first 
subclones was restricted to ectoderm. 
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the cell-state graph and comparing these scores 
to graph-derived state distances (Fig. 5B and fig. 
S10) revealed that some nearby states on the state 
graph were indeed clonally correlated, as expected 
by the simplistic model. However, nearby cell 
states also frequently displayed weak clonal cor- 
relations, suggesting convergent differentiation. 
These patterns were evident among state rela- 
tionships for endothelial, optic cup, and muscle 
tissues (Fig. 5B and fig. S10, A to F) and sys- 
tematically when examining all states (fig. S10G). 

We observed considerably fewer cases of di- 
vergent clonal behavior (fig. S10G). However, one 
notable example manifested as apparent looping 
of the neural crest into the pharyngeal arches, 
which originate in the graph from both neural 
plate and lateral plate mesoderm and merge at 
18 to 24 hpf (Fig. 2, A and B, and fig. SIA). Al- 
though the contribution of neural crest to various 
mesenchymal tissues is well established (26-28), 
the transcriptional information reflected by the 
graph loop alone does not reveal which annotated 
pharyngeal arch states arise from neural crest. 
TracerSeq data, however, provide a clear signature 
of distinct clonal patterns between pharyngeal 
arch states: One pharyngeal arch state (ph.arch- 
tbx1) is a member of the MesEndo lineage group 
with mesodermal clonal associations, whereas 
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Fig. 5. TracerSeq reveals systematic relationships between cell line- 
age and cell state. (A) Heatmap of TracerSeq lineage coupling scores 
(see methods) between pairs of 24-hpf states, clustered by correlation 
distance and average linkage. Groups of states with similar lineage 


the second pharyngeal arch state (ph.arch-cd248b) 
is clonally related to neural crest and posterior 
neural states (Fig. 5A and fig. S11, B to F). These 
data indicate that cells in the ph.arch-cd248b 
state diverged from a neural plate lineage and 
subsequently converged with other lateral plate- 
derived states. The ability of embryonic clones 
to undergo dramatic converging and diverging 
behaviors thus underscores a continued need for 
independent measurements of both cell state 
and lineage in the mapping of cell-fate hierarchies. 


Robustness of cell-type transcriptional 
programs after a signaling perturbation 


Single-cell maps of vertebrate development can, 
in principle, facilitate unbiased, systematic analy- 
ses of mutant phenotypes and disease states. We 
used scRNA-seq to analyze the mutant pheno- 
type for chordin, a well-studied developmental 
gene that encodes a secreted bone morphoge- 
netic protein (BMP) inhibitor expressed in the 
organizer and required for patterning the early 
dorsal-ventral axis (29-33). Disruption of chordin 
leads to changes in gross embryo morphology, 
with an expansion of ventral tissues and a re- 
duction of dorsal tissues (30). The scRNA-seq 
method is well suited to address how all cell 
types in the embryo change in frequency, and 
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in gene expression, while also allowing detec- 
tion of qualitatively new states, or combinations 
of states, if they occur. 

We used CRISPR-Cas9 (34) to disrupt the 
chordin locus, resulting in highly penetrant 
clutches of mutant zebrafish embryos (fig. $12). We 
performed inDrops profiling on chordin-targeted 
and control embryos (tyrosinase-targeted, see 
methods) in a narrow time series correspond- 
ing to ~14 to 16 hpf (Fig. 6A). After sequencing, 
we classified each of the chordin- and control- 
targeted cells to reference cell clusters of the 
14-hpf wild-type embryo (fig. S13 and methods) 
and tested for altered gene expression. We rea- 
soned that a qualitatively new cell state, if formed 
as a result of the aberrant patterning, would man- 
ifest as widespread changes in gene expression 
after mutation, with a magnitude comparable 
to the differences between wild-type embryonic 
states. Applying this criterion, we found no evi- 
dence of a qualitatively new cell state after chordin 
depletion. Rather, the number of genes differen- 
tially expressed within states was modest com- 
pared to the differences defining the wild-type 
states of the 14-hpf embryo (Fig. 6B and fig. S14A). 
Moreover, a t-distributed stochastic neighbor em- 
bedding (tSNE) mapping of CRISPR-targeted 
cells (fig. S13, A to C) identified only a single clus- 
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coupling signatures are annotated. (B) Quantitative relationships between 
lineage coupling correlation distances and scaled state tree diffusion 
distances for (i) endothelial, (ii) optic cup, and (iii) my/* muscle states 
(see also fig. S10, A to F). 


ter solely occupied by chordin-mutant cells (fig. 
S13D), distinguished primarily by a heat shock- 
like transcriptional signature. This same stress 
signature was increased in multiple states in 
chordin-targeted embryos (fig. S14A). 

We next tested whether chordin disruption led 
to changes in abundance of particular classified 
cell types. As expected, expansion of states cor- 
responding to ventral tissues (for example, so- 
mitic mesoderm, epidermis, hatching gland, 
blood, and endothelial tissues) at the expense 
of dorsal tissues (for example, the neural plate 
and notochord) was observed (fig. S14, A and 
B) (30, 35). Additional features could be appre- 
ciated by projecting the CRISPR datasets direct- 
ly onto the wild-type single-cell graph (Fig. 6, C 
and D). For example, a sharp boundary bisected 
the lateral plate mesoderm into two compart- 
ments of opposing chordin sensitivity, separat- 
ing the heart and fin bud progenitor fields. Similar 
juxtaposed domains of opposing chordin sen- 
sitivity were evident in the axial mesoderm, 
partitioning notochord from hatching gland, 
and in the tailbud separating spinal cord from 
somitic mesoderm (Fig. 6D). Notably, each of 
these pairs of phenotypic domains appeared to 
beorganized downstream of an inferred branch- 
point in the cell-state landscape. These domain 
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pairs, therefore, likely reflect binary fate choices 
that are tuned by BMP signaling in wild-type 
embryos. 

In a final analysis, we searched for the putative 
identity of the cells responding to chordin in the 
tailbud, as this is the site showing the largest 
expansion (somitic mesoderm) and loss (spinal 
cord) after perturbation. In zebrafish, chordin is 
expressed in the embryonic shield, adaxial cells, 
posterior tailbud region, and also transiently 
in the neural plate (36). All of these expression 
patterns were confirmed in our single-cell graphs 
(fig. S15A). Furthermore, in contrast to its earlier 
expression in the shield, continued expression of 
chordin in the tailbud was distinct among a large 
panel of known BMP inhibitor genes (fig. S15A) 
and was tightly apposed by expression domains 
for multiple bmp transcripts (fig. SI5B). These 
expression characteristics might explain the in- 
creased chordin sensitivity of posterior body re- 
gions. To examine this region in greater detail, 
we isolated a subgraph of tailbud and descend- 
ent cells. Consistent with previous studies, two 
cell-state trajectories branching from a common 
neuromesodermal-like brachyury’ ;sox2* progeni- 
tor state were identified, each expressing markers 
of neural fates (soxw3, sox19a, pax6a, and neurogl) 
or somitic fates (tbx16, tbx6, thbx24, msgni, and 
myodl) (fig. S16, A to C) (25, 37-39). Notably, 
the neural-mesodermal branchpoint coincided 
with the boundaries of both chordin expression 
and sensitivity (fig. S16, D and E). The chordin- 
expressing cells in this region of the single-cell 
graph exhibited a distinct expression profile (fig. 
$17), including a cadherin (cdh11), early neuro- 
genic markers (her3, her8a, and sox19a), and 
several relatively uncharacterized genes (gig2g, 
foxbib, and foxbia). We hypothesize that these 
cells represent a key transition state, at which 
point tailbud cells initiate a posterior neuro- 
genic program in a chordin-dependent manner. 


Discussion 


Our study demonstrates a graph-based approach 
for mapping whole-embryo developmental land- 
scapes, over time, from scRNA-seq data. The 
graph was constructed with minimal assump- 
tions about development and describes indi- 
vidual cell states transitioning from pluripotent 
blastomeres to a large array of cell types and 
tissues during the first day of zebrafish embryo- 
genesis. This dataset can now be mined to iden- 
tify temporal and tissue associations for any 
gene, cell type, or biological process of interest. 
As with genome annotation efforts over the years, 
we expect that the annotation of identified cell 
states may undergo refinement with community 
input. 

As single-cell atlases and landscapes of embryo 
development become routinely available, one is 
challenged to reconsider the relationship between 
a cell lineage (by definition, a tree) and the con- 
siderably more topologically complex gene ex- 
pression landscape through which these cells 
traverse. Using TracerSeq, we confirmed that 
differentiating cells of the zebrafish embryo do 
not invariantly follow treelike hierarchies. In- 
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stead, we observed both widespread convergence 
in cell states for clonally distant cells and instances 
in which clonally related cells diverged into distant 
states. Non-treelike convergence of cell states 
could be explained by the differentiation of well- 
separated spatial domains of the embryo into the 
same basic cell types (for example, along the 
anterior-posterior axis), whereas divergence could 
involve mechanisms such as asymmetric cell di- 
vision or exposure to spatially varying signals 
(40). We anticipate that the synthesis of single- 
cell lineage and transcriptome information will 
continue to be crucial for deciphering how cells 


traverse state trajectories with complex topologies 
(for example, loops or continua). 

Single-cell mapping of genetic perturbation 
data presents a powerful framework for iden- 
tifying regulatory features of a developmental 
landscape. After disruption of chordin, which 
encodes a BMP inhibitor, we showed that the 
defining transcriptional features of the land- 
scape remained mostly unchanged, yet cell-state 
abundances could be dramatically and recip- 
rocally altered, as if the landscape were “tilted” 
but cell fates remain canalized. Future systematic 
mapping of signaling perturbations could be 
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Fig. 6. Regulatory features of the developmental landscape identified by genetic perturbation. 
(A) Overview of the CRISPR experiment. Three pairs of chordin- and tyrosinase (control)—targeted 
samples were prepared and processed by inDrops at ~14 to 16 hpf. (B) Histogram depicting numbers 
of differentially expressed genes (DEGs) identified in chordin versus control (tyrosinase) cells for each 
state (blue bars), compared to DEG numbers when comparing between all state pairs (red bars). DEGs 
were identified by Wilcoxon rank sum test (adjusted P value < 0.01, absolute logs fold change > 1, 
average expression > 25 transcripts per million). (©) Histogram of Pearson correlation similarities 
(after PCA projection) between each chordin or tyrosinase cell and its nearest neighbor from 10-, 14-, and 
18-hpf wild-type datasets (see methods). (D) Logs ratios of cell states with significant differential 
abundance (false discovery rate < 0.25) in the chordin versus tyrosinase samples. Purple and green 
regions correspond to wild-type cell states that are over- or underrepresented in the chordin mutant, 
respectively. Adjacent graph domains with opposing chordin sensitivity are highlighted by brackets. TB, 
tailbud region (see cdx4 expression in fig. S3). 
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used to reveal the complete signaling logic of 
the embryo, as cells are specified toward their 
final fates. Together, these studies demonstrate 
the power, modularity, and quantitative benefits 
of unbiased scRNA-seq-based interrogations of 
embryonic development. We anticipate that sim- 
ilar large-scale datasets will facilitate explora- 
tions of additional developmental stages, tissues, 
and species. 
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Reducing food’s environmental 
impacts through producers 


and consumers 


J. Poore’?* and T. Nemecek® 


Food’s environmental impacts are created by millions of diverse producers. To identify solutions 
that are effective under this heterogeneity, we consolidated data covering five environmental 
indicators; 38,700 farms; and 1600 processors, packaging types, and retailers. Impact can vary 
50-fold among producers of the same product, creating substantial mitigation opportunities. 
However, mitigation is complicated by trade-offs, multiple ways for producers to achieve low 
impacts, and interactions throughout the supply chain. Producers have limits on how far they can 
reduce impacts. Most strikingly, impacts of the lowest-impact animal products typically exceed 
those of vegetable substitutes, providing new evidence for the importance of dietary change. 
Cumulatively, our findings support an approach where producers monitor their own impacts, 
flexibly meet environmental targets by choosing from multiple practices, and communicate their 


impacts to consumers. 


ith current diets and production prac- 
tices, feeding 7.6 billion people is degrad- 
ing terrestrial and aquatic ecosystems, 
depleting water resources, and driving 
climate change (J, 2). It is particularly 
challenging to find solutions that are effective 
across the large and diverse range of producers 
that characterize the agricultural sector. More 
than 570 million farms produce in almost all the 
world’s climates and soils (3), each using vastly 
different agronomic methods; average farm sizes 
vary from 0.5 ha in Bangladesh to 3000 ha in 
Australia (3); average mineral fertilizer use ranges 
from 1 kg of nitrogen per ha in Uganda to 300 kg 
in China (4); and although four crops provide half 
of the world’s food calories (4), more than 2 million 
distinct varieties are recorded in seed vaults (5). 
Further, products range from minimally to heavily 
processed and packaged, with 17 of every 100 kg of 
food produced transported internationally, increas- 
ing to 50 kg for nuts and 56 kg for oils (4). 
Previous studies have assessed aspects of this 
heterogeneity by using geospatial data sets (6-8), 
but global assessments using the inputs, outputs, 
and practices of actual producers have been lim- 
ited by data. The recent rapid expansion of the 
life cycle assessment (LCA) literature is providing 
this information by surveying producers around 
the world. LCA then uses models to translate pro- 
ducer data into environmental impacts with suf- 
ficient accuracy for most decision-making (9-11). 
To date, efforts to consolidate these data or build 
new large-scale data sets have covered greenhouse 
gas (GHG) emissions only (8, 12, 13), agriculture 
only (13-16), small numbers of products (8, 14-16), 
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and predominantly Western European producers 
(12-16) and have not corrected for important meth- 
odological differences between LCAs (12-16). Here, 
we present a globally reconciled and methodolog- 
ically harmonized database on the variation in food’s 
multiple impacts. Our results show the need for 
far-reaching changes in how food’s environmental 
impacts are managed and communicated. 


Building the multi-indicator 
global database 


We derived data from a comprehensive meta- 
analysis, identifying 1530 studies for potential 
inclusion, which were supplemented with addi- 
tional data received from 139 authors. Studies 
were assessed against 11 criteria designed to 
standardize methodology, resulting in 570 suit- 
able studies with a median reference year of 
2010 (17). The data set covers ~38,700 commer- 
cially viable farms in 119 countries (fig. S2) and 
40 products representing ~90% of global pro- 
tein and calorie consumption. It covers five im- 
portant environmental impact indicators (78): 
land use; freshwater withdrawals weighted by 
local water scarcity; and GHG, acidifying, and 
eutrophying emissions. For crops, yield repre- 
sents output for a single harvest. Land use in- 
cludes multicropping (up to four harvests per 
year), fallow phases (uncultivated periods be- 
tween crops), and economic allocation to crop 
coproducts such as straw. This makes it a stron- 
ger indicator of both farm productivity and 
food security than yield. 

The system we assess begins with inputs (the 
initial effect of producer choice) and ends at re- 
tail (the point of consumer choice) (fig. S1). For 
each study, we recorded the inventory of out- 
puts and inputs (including fertilizer quantity 
and type, irrigation use, soil, and climatic con- 
ditions). Where data were not reported, for ex- 
ample, on climate, we used study coordinates 
and spatial data sets to fill gaps. We recorded 


environmental impacts at each stage of the sup- 
ply chain. For GHG emissions, we further disag- 
gregated the farm stage into 20 emission sources. 
We then used the inventory to recalculate all 
missing emissions. For nitrate leaching and 
aquaculture, we developed new models for this 
study (17). 

Studies included provided ~1050 estimates 
of postfarm processes. To fill gaps in process- 
ing, packaging, or retail, we used additional 
meta-analyses of 153 studies providing 550 ob- 
servations. Transport and losses were included 
from global data sets. Each observation was 
weighted by the share of national production it 
represents, and each country by its share of 
global production. We then used randomiza- 
tion to capture variance at all stages of the 
supply chain (/7). 

We validated the global representativeness of 
our sample by comparing average and 90th- 
percentile yields to Food and Agriculture Or- 
ganization (FAO) data (4), which reconcile to 
within +10% for most crops. Using FAO food 
balance sheets (4), we scaled up our sample data. 
Total arable land and freshwater withdrawals 
reconcile to FAO estimates. Emissions from de- 
forestation and agricultural methane fall within 
ranges of independent models (17). 


Environmental impacts of the entire 
food supply chain 


Today’s food supply chain creates ~13.7 billion 
metric tons of carbon dioxide equivalents (CO2eq), 
26% of anthropogenic GHG emissions. A further 
2.8 billion metric tons of COzeq (5%) are caused 
by nonfood agriculture and other drivers of de- 
forestation (17). Food production creates ~32% 
of global terrestrial acidification and ~78% of 
eutrophication. These emissions can fundamen- 
tally alter the species composition of natural 
ecosystems, reducing biodiversity and ecological 
resilience (19). The farm stage dominates, rep- 
resenting 61% of food’s GHG emissions (81% 
including deforestation), 79% of acidification, 
and 95% of eutrophication (table S17). 

Today’s agricultural system is also incredibly 
resource intensive, covering ~43% of the world’s 
ice- and desert-free land. Of this land, ~87% is 
for food and 13% is for biofuels and textile crops 
or is allocated to nonfood uses such as wool and 
leather. We estimate that two-thirds of freshwater 
withdrawals are for irrigation. However, irriga- 
tion returns less water to rivers and groundwater 
than industrial and municipal uses and pre- 
dominates in water-scarce areas and times of 
the year, driving 90 to 95% of global scarcity- 
weighted water use (17). 


Highly variable and skewed 
environmental impacts 


We now group products by their primary dietary 
role and express impacts per unit of primary 
nutritional benefit (Fig. 1 and fig. S3). Immedi- 
ately apparent in our results is the high variation 
in impact among both products and producers. 
Ninetieth-percentile GHG emissions of beef 
are 105 kg of CO,eq per 100 g of protein, and 
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ey 
Crustaceans (farmed) 1.0k > 54 18 . 0.4 2.0 C= — oe 
Cheese 1.9k eye 61 1 >44 4 aa a oS 
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Oatmeal 139 0.3 0.9 . 11° 2.9 Hi || BB 
Potatoes 604 . 0.2 0.6 . 06 1.2 i Tl jes 
Wheat & Rye (Bread) 8.8k cy 0.3 0.6 . 04 1.4 i Ea fess 
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Fig. 1. Estimated global variation in GHG emissions, land use, terrestrial acidification, 
eutrophication, and scarcity-weighted freshwater withdrawals, within and between 
40 major foods. (A) Protein-rich products. Grains are also shown here given that they 
contribute 41% of global protein intake, despite lower protein content. (B) Milks. 
(C) Starch-rich products. (D) Oils. (E) Vegetables. (F) Fruits. (G) Sugars. (H) Alcoholic 
beverages (1 unit = 10 ml of alcohol; ABV, alcohol by volume). (1) Stimulants. n = farm 


or regional inventories. Pc and pctl., percentile; scty., scarcity. 
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land use (area multiplied by years occupied) is 
370 m?-year. These values are 12 and 50 times 
greater than 10th-percentile dairy beef impacts 
(which we report separately given that its pro- 
duction is tied to milk demand). Tenth-percentile 
GHG emissions and land use of dairy beef are 
then 36 and 6 times greater than those of peas. 
High variation within and between protein-rich 
products is also manifest in acidification, eutro- 
phication, and water use. 

Within the major crops wheat, maize, and rice, 
90th-percentile impacts are more than three times 
greater than 10th-percentile impacts on all five 
indicators. Within major growing areas for these 
crops (the Australian wheat belt, the U.S. corn 
belt, and the Yangtze river basin), land use be- 
comes less variable, but we observe the same 
high levels of variation in all other indicators. 
This variability, even among producers in similar 
geographic regions, implies substantial potential 
to reduce environmental impacts and enhance 
productivity in the food system. 

For many products, impacts are skewed by 
producers with particularly high impacts. This 
creates opportunities for targeted mitigation, 
making an immense problem more manageable. 
For example, for beef originating from beef herds, 
the highest-impact 25% of producers represent 
56% of the beef herd’s GHG emissions and 61% of 
the land use (an estimated 1.3 billion metric tons 
of CO,eq and 950 million ha of land, primarily 
pasture). Across all products, 25% of producers 
contribute on average 53% of each product’s envi- 
ronmental impact (fig. $3). For scarcity-weighted 
freshwater withdrawals, the skew is particular- 
ly pronounced: Producing just 5% of the world’s 
food calories creates ~40% of the environmental 
burden. We will now explore how to access these 
mitigation opportunities through heterogenous 
producers. 


Mitigation through producers 
Enable producers to monitor 
multiple impacts 


The first step in mitigation is estimating pro- 
ducer impacts. Prior research [e.g., (7, 8, 14)] has 
suggested that readily measurable proxies pre- 
dict farm-stage impacts, avoiding the need for 
detailed assessment. From our larger data set, 
which includes more practices and geographies 
than prior studies, we assess the predictive power 
of common proxies, including crop yield, nitro- 
gen use efficiency, milk yield per cow, liveweight 
gain, pasture area, and feed conversion ratios. 
Although most proxies significantly covary with 
impact, they make poor predictors when used 
alone, explaining little of the variation among 
farms (coefficient of determination R? = 0 to 27% 
in 47 of 48 proxy-impact combinations assessed) 
(fig. S4). 

Prior research has also suggested using one 
impact indicator to predict others (20). We find 
weakly positive and sometimes negative relation- 
ships between indicators. For similar products 
globally, correlations between indicators are low 
(R® = 0 to 30% in 26 of 32 impact-impact com- 
binations assessed) (fig. $4). Pork, poultry meat, 
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and milk show higher correlations between acid- 
ification and eutrophication (R? < 54%), explained 
by the dominant role of manure in these impacts, 
but this does not generalize to other products or 
indicators. The same conclusion holds for farms 
in similar geographies or systems (fig. $5). 

Monitoring multiple impacts and avoiding 
proxies supports far better decisions and helps 
prevent harmful, unintended consequences. How- 
ever, two recent studies suggest that data on 
practices and geography, required to quantify 
impacts, must come directly from producers 
(11, 21), that quantifying impacts with the use 
of satellite or census data misses much of the 
variation among farms. 


Set and incentivize mitigation targets 


When land use or emissions are low, we find 
trade-offs between indicators for many crops 
(fig. S5). This reflects diminishing marginal yield 
with increasing inputs as crops tend toward their 
maximum yields (22). For example, for already 
low-emission Northern European barley farms, 
halving land use can increase GHG emissions per 
kilogram of grain by 2.5 times and acidification 
by 3.7 times. To explore trade-offs further, we pair 
observations from the same study, location, and 
year that assess a practice change (fig. S6). Of the 
nine changes assessed, only two (changing from 
monoculture to diversified cropping and improv- 
ing degraded pasture) deliver statistically significant 
reductions in both land use and GHG emissions. 

Geography influences these trade-offs. For ex- 
ample, in the Australian wheat belt, where farmers 
practice low-rainfall, low-input farming, we find 
that both output per hectare and GHG emissions 
are in the bottom 15% globally. The environmental 
and social importance of different impacts also 
varies locally, given land scarcity, endemic bio- 
diversity, and water quality, among other factors. 
Setting regional and sector-specific targets will 
help producers navigate trade-offs and make 
choices that align with local and global priorities. 


Meet targets by choosing from multiple 
practice changes 


To meet these targets, policy might encourage wide- 
spread adoption of certain practices. However, the 
environmental outcomes of many practices, such as 
conservation agriculture (23), organic farming (fig. 
S6), and even integrated systems of best practice 
(24), are highly variable. Using our data set, we can 
generalize these findings. To do this, we disaggre- 
gate each environmental indicator into its sources 
or drivers. We consider practice change as a pack- 
age of measures that targets one or more of these 
sources. If producers have different impact sources, 
the effects of practice change will be variable. 

We find that sources of impact vary consider- 
ably among farms producing the same product 
(Fig. 2 and figs. S7 to S9). Priority areas for 
reducing impact for one farm may be immaterial 
for another. For example, measures to reduce di- 
rect nitrous oxide emissions from synthetic and 
organic fertilizer, such as biochar application, are 
included in many mitigation estimates (25). How- 


ever, for a third of global crop calorie production, 


Poore et al., Science 360, 987-992 (2018) 1 June 2018 


A Wheat farms 
@ Seed 


Fertilizer & Pesticide 
Manufacture 


=o Equipment 
@ Electricity & Fuel 


@ Direct Synthetic 
Fertilizer 

@ Indirect (N;O) 

@ Organic Fertilizer (NO) 


™ Crop Residue 
Urea (CO,) 
Lime (CO,) 
™@ Residue Burning 
@ Drying 
0% 25% 50% 75% 100% 
Contribution of each source to farm- 
stage GHG emissions 


oi. § 10th 90th 


Distribution Percentiles 


B Beef farms 
Concentrate Feed (incl. 


land use change) 
m Pasture Management 


™ Housing 
m Enteric Fermentation (CH,) 


™ Manure Management (CH,) 
Manure 
ao 


@ Direct 
@ Indirect 
0% 25% 50% 75% 100% 


Contribution of each source to farm- 
stage GHG emissions 


C Below median GHG emissions wheat farms 


— —_— oi sor” 


r 75% 
F 50% 
| f 25% 


Australia, 
conventional 


Australia, 
no-till, 
residue burnt 


Cine 
irrigated 


"Germany, - 
organic 


Nepal, 
raised beds 


SUOISSIWS DHD ebe}s-Wue} 0} 9oiNOs YORE Jo UONqUIUOD 


D Below median GHG emissions beef farms 


s0unos yoke Jo UONNqUIUCD 


100% 
75% 
50% 
25% 
0% 


“Australia, 
grass-fed 


Brazil, ” Canada, China, "Denmark, 
improved feedlot confinement dairy, 9mo 
pasture withimplants dairy at slaughter 


Fig. 2. Contributions of emission sources to total farm-stage GHG emissions. (A and B) Gray 
bars show 10th- and 90th-percentile contributions. Shaded bars represent the distribution. For 
example, the 90th-percentile contribution of organic fertilizer N20 to farm-stage emissions is 
16%, but for most wheat producers the contribution is near 0%. Density is estimated using a 
Gaussian kernel with bandwidth selection performed with biased cross-validation. (© and D) 
Contributions of emission sources for example producers with below-median GHG emissions. 


these emissions represent less than 5% of farm- 
stage GHGs. It may be the case that low-impact 
farms have similar impact drivers. We again find 
variable sources of impact, even for low-impact 
farms (Fig. 2, C and D). Reducing impacts means 
focusing on different areas for different producers 
and, by implication, adopting different practices. 
To explore this further, we use sensitivity analysis 
(26) to decompose the variance in each product’s 
impact into its sources. Numerous sources con- 
tribute to variance (fig. S10). Most notably, for all 
crop calorie production globally, differences in fal- 
low duration and multiple cropping drive 40% of 
the variance in land use. This is important as most 
strategies to increase productivity are focused on 
increasing single crop yields (27). But for many 
producers, increasing cropping intensity through 
the use of early-maturing varieties, intercropping, 
catch crops, and enhanced irrigation can provide 
more economically viable and trade-off-free ways 
to boost productivity and reduce impacts (27). 
Geography plays a major role in this variation 
and affects the economic and environmental 
desirability of different practices (28). However, 
at the heart of agriculture is changing site con- 
ditions to enhance productivity (such as liming, 
terracing, or installing drainage), meaning that 
statements on the importance of geography have 
limitations. Nevertheless, some impact sources 
stand out. We find that freshwater aquaculture 
ponds create 0 to 450 g of methane per kg of 
liveweight (for context, enteric fermentation 


in dairy cows creates ~30 to 400 g per kg of live- 
weight). Of this variation, a third is explained by 
temperature (17), which accelerates methano- 
genesis and net primary production. Improving 
aeration and limiting addition of surplus feed 
to ponds can abate these emissions, particularly 
important in warm countries. Further, for every 
kilogram of nitrogen applied to crops, between 
60 and 400 g is lost in reactive forms. Of this 
wide range, ~40% is explained by site conditions, 
including soil pH, temperature, and drainage 
(17). Prior research has also found that the po- 
tential of soil to store carbon varies significantly 
with soil properties, slope, and prior practice (29). 

Providing producers with multiple ways to 
reduce their environmental impacts recognizes 
the variability in sources and drivers of impact 
but requires a step change in thinking: that prac- 
tices such as conservation agriculture or organic 
farming are not environmental solutions in them- 
selves but options that producers choose from 
to achieve environmental targets. 

However, some practice changes can be pur- 
sued across all producers. Methane from flooded 
rice, enteric methane from ruminants, and con- 
centrate feed for pigs and poultry are sizeable 
globally, representing 30% of food’s GHG emis- 
sions; are material for all producers, contributing 
at least 17% of farm-stage emissions (Fig. 2B and 
fig. S7); and can be mitigated with relatively trade- 
off-free approaches such as shorter and shallower 
rice flooding (30), improving degraded pasture 
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Fig. 3. Mean and 10th-percentile GHG emissions of protein-rich products across three major 
production stages. (A to C) Red lines represent average vegetable protein emissions, and blue 
lines represent 10th-percentile emissions. The gray line represents 10th-percentile emissions 
excluding nuts, which can temporarily sequester carbon if grown on cropland or pasture. To 
calculate 10th-percentile emissions by stage, we averaged across farms that have total emissions 
between the 5th and 15th percentiles, controlling for burden shifting between stages. 


(fig. S6), and improving lifetime animal produc- 
tivity (8). Further, emissions from deforestation 
and cultivated organic soils drive on average 
42% of the variance in each product’s agricul- 
tural GHG emissions (fig. S10) and dominate 
the highest-impact producers’ emissions (fig. S11), 
further justifying ongoing efforts to curb forest 
loss and limit cultivation on peatlands. 


Communicate impacts up the 
supply chain 
Processors, distributors, and retailers can substan- 
tially reduce their own impacts. For any product, 
90th-percentile postfarm emissions are 2 to 140 
times larger than 10th-percentile emissions, indi- 
cating large mitigation potential (fig. S12). For 
example, returnable stainless steel kegs create just 
20 g of COgeq per liter of beer, but recycled glass 
bottles create 300 to 750 g of COseq, and bottles 
sent to landfills create 450 to 2500 g of COzeq. 
Processing, more durable packaging, and 
greater usage of coproducts can also reduce food 
waste. For example, wastage of processed fruit 
and vegetables is ~14% lower than that of fresh 
fruit and vegetables, and wastage of processed 
fish and seafood is ~8% lower (24). Providing 
processors and retailers with information about 


the impacts of their providers could encourage 
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them to reduce waste where it matters most. 
For products such as beef, distribution and retail 
losses contribute 12 to 15% of emissions (fig. S13), 
whereas the sum of emissions from packaging, 
transport, and retail contributes just 1 to 9%. 
Here, reducing losses is a clear priority. 

As a third strategy, procurement could source 
from low-impact farms. Although this strategy is 
important, and possible only with information 
about the impacts of providers, it has clear limita- 
tions. To be effective, it relies on high-impact 
production not simply being purchased elsewhere 
in the market. The case of the Roundtable on 
Sustainable Palm Oil (RSPO) shows that this is 
hard to achieve: despite one-fifth of 2017 palm oil 
production being certified, there remains virtu- 
ally no demand in China, India, and Indonesia 
(31). Alternatively, this strategy would be ef- 
fective if higher prices for sustainable produc- 
tion incentivized low-impact producers to increase 
output or high-impact producers to change prac- 
tices. The case of organic food shows how passing 
premiums to consumers limits total market size 
and widespread practice change. 

However, processors and retailers routinely 
demand that products meet taste, quality, and 
food safety standards. These markets are con- 
centrated, with just 10 retailers representing 


52% of U.S. grocery sales and 15% of global sales 
(32). This sometimes means that standards 
achieve market transformation (33), where vir- 
tually all producers adhere to gain market access. 
A fourth strategy for producers is setting en- 
vironmental standards. These are particularly 
important: Although many environmental issues 
can be monitored and mitigated in a flexible 
way, issues such as harmful pesticide usage and 
deforestation require strict controls, and issues 
such as on-farm biodiversity are hard to quantify 
(28). Procurement, farming organizations, and in- 
ternational policy-makers must come together to 
implement a safety net for global agriculture— 
comprehensive standards to manage the worst 
and hardest-to-quantify environmental issues, 
extending the successes of existing schemes and 
enabling a flexible mitigation approach to op- 
erate effectively. 


Producer mitigation limits and the role 
of consumers 


Though producers are a vital part of the solu- 
tion, their ability to reduce environmental impacts 
is limited. These limits can mean that a product 
has higher impacts than another nutritionally 
equivalent product, however it is produced. 
In particular, the impacts of animal products 
can markedly exceed those of vegetable substitutes 
(Fig. 1), to such a degree that meat, aquaculture, 
eggs, and dairy use ~83% of the world’s farmland 
and contribute 56 to 58% of food’s different emis- 
sions, despite providing only 37% of our protein and 
18% of our calories. Can animal products be pro- 
duced with sufficiently low impacts to redress this 
vast imbalance? Or will reducing animal product 
consumption deliver greater environmental benefits? 
We find that the impacts of the lowest-impact 
animal products exceed average impacts of sub- 
stitute vegetable proteins across GHG emissions, 
eutrophication, acidification (excluding nuts), and 
frequently land use (Fig. 1 and data S2). These 
stark differences are not apparent in any product 
groups except protein-rich products and milk. 
Although tree crops can temporarily sequester 
carbon and reduce nutrient leaching, the impact 
of nuts is dominated by low-yielding cashews 
and water-, fertilizer-, and pesticide-intensive 
almonds. Production of nuts doubled between 
2000 and 2015 (4), and more work is required to 
improve their resource use efficiency. Although 
aquaculture can have low land requirements, in 
part by converting by-products into edible pro- 
tein, the lowest-impact aquaculture systems still 
exceed emissions of vegetable proteins. This chal- 
lenges recommendations to expand aquaculture 
(J without major innovation in production prac- 
tices first. Further, though ruminants convert 
~2.7 billion metric tons of grass dry matter, of 
which 65% grows on land unsuitable for crops 
(34), into human-edible protein each year, the 
environmental impacts of this conversion are 
immense under any production method prac- 
ticed today. 
Using GHG emissions (Fig. 3), we identified five 
primarily biophysical reasons for these results. 
These reasons suggest that the differences between 
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Fig. 4. Graphical representation of the mitigation framework. 


animal and vegetable proteins will hold into the 
future unless major technological changes dispro- 
portionately target animal products. First, emissions 
from feed production typically exceed emissions of 
vegetable protein farming. This is because feed- 
to-edible protein conversion ratios are greater than 
2 for most animals (13, 34); because high usage of 
low-impact by-products is typically offset by low 
digestibility and growth; and because additional 
transport is required to take feed to livestock. Sec- 
ond, we find that deforestation for agriculture is 
dominated (67%) by feed, particularly soy, maize, 
and pasture, resulting in losses of above- and below- 
ground carbon. Improved pasture management 
can temporarily sequester carbon (25), but it re- 
duces life-cycle ruminant emissions by a maximum 
of 22%, with greater sequestration requiring more 
land. Third, animals create additional emissions 
from enteric fermentation, manure, and aquaculture 
ponds. For these emissions alone, 10th-percentile 
values are 0.4 to 15 kg of COzeq per 100 g of pro- 
tein. Fourth, emissions from processing, particu- 
larly emissions from slaughterhouse effluent, add 
a further 0.3 to 11 kg of CO,eq, which is greater 
than processing emissions for most other products. 
Last, wastage is high for fresh animal products, 
which are prone to spoilage. 


Mitigation through consumers 


Today, and probably into the future, dietary 
change can deliver environmental benefits on 
ascale not achievable by producers. Moving from 
current diets to a diet that excludes animal pro- 
ducts (table S13) (35) has transformative potential, 
reducing food’s land use by 3.1 (2.8 to 3.3) billion ha 
(a 76% reduction), including a 19% reduction 
in arable land; food’s GHG emissions by 6.6 (5.5 to 
7A) billion metric tons of CO.eq (a 49% reduction); 
acidification by 50% (45 to 54%); eutrophication by 
49% (37 to 56%); and scarcity-weighted freshwater 
withdrawals by 19% (—5 to 32%) for a 2010 refer- 
ence year. The ranges are based on producing new 
vegetable proteins with impacts between the 10th- 
and 90th-percentile impacts of existing produc- 
tion. For the United States, where per capita meat 
consumption is three times the global average, 


dietary change has the potential for a far greater 
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effect on food’s different emissions, reducing them 
by 61 to 73% [see supplementary text (77) for diet 
compositions and sensitivity analyses and fig. 
$14 for alternative scenarios]. 

Consumers can play another important role by 
avoiding high-impact producers. We consider a 
second scenario where consumption of each ani- 
mal product is halved by replacing production 
with above-median GHG emissions with vegeta- 
ble equivalents. This achieves 73% of the previous 
scenario’s GHG reduction and 67, 64, and 55% 
of the land use, acidification, and eutrophication 
reductions. Further, lowering consumption of 
more discretionary products (oils, sugar, alcohol, 
and stimulants) by 20% by avoiding production 
with the highest land use reduces the land use of 
these products by 39% on average. For emissions, 
the reductions are 31 to 46%, and for scarcity- 
weighted freshwater withdrawals, 87%. 

Communicating average product impacts to 
consumers enables dietary change and should 
be pursued. Though dietary change is realistic 
for any individual, widespread behavioral change 
will be hard to achieve in the narrow timeframe 
remaining to limit global warming and prevent 
further, irreversible biodiversity loss. Communi- 
cating producer impacts allows access to the 
second scenario, which multiplies the effects of 
smaller consumer changes. 


An integrated mitigation framework 


In Fig. 4 we illustrate a potential framework im- 
plied by our findings, prior research, and emerg- 
ing policy (9). First, producers would monitor 
their impacts using digital tools (36). Data would 
be validated against known ranges for each value 
(e.g., maximum yields given inputs) and validated 
or certified independently. In the United States 
these tools have already been integrated with ex- 
isting farm software (31); in Africa and South Asia 
they are in trials with 2G mobile phones (37); and 
in China they have been operated by extension 
services with extremely successful results (24). 

Second, policy-makers would set targets on 
environmental indicators and incentivize them 
by providing producers with credit or tax breaks 
or by reallocating agricultural subsidies that now 


exceed half a trillion dollars a year worldwide 
(38). Third, the assessment tools would provide 
multiple mitigation and productivity enhancement 
options to producers. Ideally these tools would be- 
come platforms that consolidate the vast amounts 
of research conducted by scientists around the 
world, while also sharing producer best practices. 
In particular, practice sharing offers a very effec- 
tive way to engage producers (24). Maximum 
flexibility also ensures least-cost mitigation (39) 
and supports producer-led innovation (24). 

Finally, impacts would be communicated up the 
supply chain and through to consumers. For com- 
modity crops that are hard to trace (37), this may 
not be feasible and mitigation efforts may have to 
focus on producers. For animal products, stringent 
traceability is already required in many countries 
(40), suggesting that communicating impacts is 
most feasible where it matters the most. Commu- 
nication could occur through a combination of en- 
vironmental labels, taxes or subsidies designed to 
reflect environmental costs in product prices (35), 
and broader education on the true cost of food. 

We have consolidated information on the prac- 
tices and impacts of a wide range of producers. 
From this research, we have provided a unified 
exposition of the environmental science for mak- 
ing major changes to the food system. We hope 
this stimulates progress in this crucially impor- 
tant area. 
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Dunes on Pluto 
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The surface of Pluto is more geologically diverse and dynamic than had been expected, 
but the role of its tenuous atmosphere in shaping the landscape remains unclear. We 
describe observations from the New Horizons spacecraft of regularly spaced, linear ridges 
whose morphology, distribution, and orientation are consistent with being transverse 
dunes. These are located close to mountainous regions and are orthogonal to nearby wind 
streaks. We demonstrate that the wavelength of the dunes (~0.4 to 1 kilometer) is best 
explained by the deposition of sand-sized (~200 to ~300 micrometer) particles of methane 
ice in moderate winds (<10 meters per second). The undisturbed morphology of the dunes, 
and relationships with the underlying convective glacial ice, imply that the dunes have 


formed in the very recent geological past. 


unes require a supply of particulate ma- 

terial on a surface and a fluid boundary 

layer to entrain the grains (i.e., wind, for 

dunes on a planet’s surface). They have 

been identified in some surprising loca- 
tions: Contrary to predictions (J), Saturn’s moon 
Titan has a broad belt of linear dunes encircling 
its equatorial latitudes (2), and despite the lack 
of a persistent atmosphere, eolian landforms 
(i.e., those related to wind) have also been 
suggested to occur on comet 67P/Churyumov- 
Gerasimenko (3). On 14 July 2015, NASA’s New 
Horizons spacecraft flew past Pluto, which pro- 
vided spectral data and imagery of the surface 
at resolutions as detailed as 80 m/pixel (4). The 
combination of Pluto’s low gravity (0.62 m s™, 
or!/1g that of Earth), sparse atmosphere [1 Pa (5)], 
extreme cold [~45 K (5)], and surface com- 
position [Ns, CO, HO, and CH, ices (6)] made 
pre-encounter predictions of surface processes 
challenging. However, pre-encounter speculation 
included that eolian processes, and potentially 
dunes, might be found on Pluto (7), because, despite 
the relatively thin atmosphere, the winds could 
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possibly sustain saltation (i.e., particle movement 
by ballistic hops) in the current surface conditions. 
We examined images from the Long Range Recon- 
naissance Imager (LORRI) instrument (8) on New 
Horizons, taken during the probe’s closest ap- 
proach to Pluto, to search for landforms with the 
morphological and distributional characteristics 
of dunes. We also searched spectroscopic data 
from the Multispectral Visible Imaging Camera 
[MVIC (9)] for evidence of sufficient sand-sized ice 
particles to form dunes, and discuss how subli- 
mation may play a role in lofting these particles, 
enabling them to be saltated into dunes. 


Observations from New Horizons 


The surface of Pluto, as revealed by New Horizons, 
is diverse in its range of landforms, composi- 
tion, and age (4, 70). One of the largest features, 
Sputnik Planitia (SP), is a plain of No, CO, and 
CH, ice [(6) and fig. S1] that extends across 
Pluto’s tropics and at its widest point covers 
30° of longitude (Fig. 1A). Polygonal features on 
the surface of SP, tens of kilometers across and 
bounded by trenches up to 100 m deep (Fig. 1, B 
and C) have been interpreted as the result of 
thermally driven, convective overturning of the 
ice (11, 12), which, together with the uncratered 
surface of SP (4), suggests a geologically young 
[<500 thousand years (Ka)] (11, 12) and active 
surface. Much of the western edge of the ice is 
bounded by the Al-Idrisi Montes (AIM), a moun- 
tainous region with relief of up to 5 km. On the 
SP plain bordering these mountains, distinct, 
regularly spaced, linear ridges are evident within 
a belt of ~75 km from the mountain margin 
(Fig. 2A). They have positive relief as evident 
from shadows consistent with the mountains. 
The ridges show pronounced spatial regularity 
(~0.4 to 1 km wavelength), substantial length/ 
width ratios (sometimes >20 km length), con- 
sistent shape along these lengths, and the pres- 
ence of merging and bifurcation junctions (Figs. 1, 
C and D, and 2, D and E). These junctions are 


approximately evenly spread between 47 north- 
facing bifurcations and 42 south-facing splits, and 
there is no clear spatial patterning to the direction 
of junctions. Farther from the mountain margin, 
toward the southeast, the ridges become more 
widely spaced and generally larger, while still 
in isolated fields or patches. Dark streaks are also 
found across the surface of the ice, typically be- 
hind topographic obstacles, and have been inter- 
preted as wind streaks (4). These features indicate 
that there are loose particles near and on the sur- 
face, as the streaks are thought to result from the 
deposition of suspended, fine particles in the lee of 
obstacles to wind flow (4, 5, 73) (Fig. 1, C and E). 

We have identified 357 pale-colored, linear 
ridges on SP adjacent to the AIM (Fig. 2, A to C), 
as well as six darker wind streaks in addition to 
the seven previously identified (4). The ridges 
closest to the SP/AIM mountain front are oriented 
approximately parallel with it, and ridges farther 
to the southeast shift orientation clockwise by 
~30° over a distance of ~’75 km (Fig. 2, A and B); 
the ridges farther from the SP/AIM margin are 
significantly (Mann-Whitney U = -7.41; p < 
0.0001) more widely spaced (Fig. 2C). Beyond 
the ~75-km-wide belt in which the linear ridges 
are predominantly found, the morphology of the 
surface changes, with preferential alignment of 
the ridges gradually disappearing (fig. S2), until 
the landscape is dominated by weakly aligned or 
unaligned, but still regularly dispersed, pits likely 
caused by sublimation of the ice (/4). Wind streaks 
adjacent to the SP/AIM border are perpendicular 
to the ridges and mimic the shift in orientation 
shown by the ridges (Fig. 3, A and B). Streaks 
within the zone in which the ridges are found 
(i.e., <75 km from the SP/AIM border) are geo- 
graphically (i.e., clockwise from north) oriented 
113 + 4° (1 standard deviation, o, with sample 
number 7 = 4), while more distant wind streaks 
are oriented significantly [(heteroscedastic Student’s 
t= 9.912; p < 0.001 (Fig. 3B)] differently at 153 + 
10° (lo, n = 9). 


Interpretation as dunes 


The ridges found on western SP have morpho- 
logical similarities to dunes (Figs. 1, C and D, 
and 4, A to C). In addition to analog similarities, 
we argue that these landforms are most con- 
sistent with an initial eolian depositional origin 
(i.e., dunes) on the grounds that (i) a depositional 
origin is favored by the superimposition of many 
of the dunes on the trenches bordering SP’s con- 
vective cells (Figs. ID and 2, D and E); (ii) the 
distribution of the dunes with pattern coarsen- 
ing (enlarging toward the southeast), away from 
the mountains (Fig. 4C), is characteristic of dune- 
fields; (iii) their orientation, and systematic 
regional changes to this orientation, are more 
readily explained by the wind regime than varia- 
tions in incoming solar radiation; (iv) the pres- 
ence of pronounced wind streaks, orthogonal 
to the dunes, demonstrates the potential efficacy 
of Pluto’s winds; (v) their location, on a methane- 
and nitrogen-dominated ice cap adjacent to 
mountains, is where the strongest winds and 
a supply of sediment might be expected; and 


1 of 6 


8102 ‘S eunr uo /Bio Bewedusi0ssoualos//:dijy WO. pepeojumMoqg 


RESEARCH | RESEARCH ARTICLE 


(vi) their differing morphologies and undeformed 
regular alignment differ from the randomly 
aligned, shallow pits that border on the dune 
regions of SP (Fig. 4E and fig. S2) and the deeply 
incised, discrete, aligned pits that can be found 
toward SP’s southern and eastern margins (Fig. 
4D). These pit-like features are morphologically 
distinct from the dune-like ridges farther north 
near the AIM that we discuss here (Fig. 4, A and 
D). To test this hypothesis, we use a model (15) to 
examine the saltation of sand-sized (in this case, 
~200 to 300 um) particles on Pluto. Once initiated, 
the model indicates that saltation can be sustained 
even under the low (Earth-like; 1 to 10 m s+) wind 
speeds predicted at the surface today (16). How- 
ever, the model also suggests that an additional 
process may be necessary to initially loft particles 
(15). This can be accomplished by sublimation, 
which is capable of lofting particles, and we mod- 
el this process to find that particles can be en- 
trained. This function of sublimation is in addition 
to the role sublimation may play in eroding mature 
dunes to more altered forms, which is also discussed 
in more detail below. Thus, under the current con- 
ditions, if there are sufficiently noncohesive sand- 
sized particulates on the surface of Pluto, we should 
expect to find dunes. 

Terrestrial and planetary dunes that form 
straight ridges can occur either perpendicular to 
the wind, forming transverse dunes, or parallel 
to the net local wind regime, forming longitudinal 
(or linear) dunes, and regional variation in the 
alignment of such dunes on Earth is typically 
associated with meso- or large-scale atmospheric 
patterns (77). Wind streaks are well known on 
Venus and Mars, are present even under the 
tenuous atmosphere of Triton, and are generally 
considered to represent the wind direction (73). 
The presence of pronounced wind streaks (Fig. 1C) 
within the dunefield, very nearly orthogonal to 
the dune trends, suggests that the observed dunes 
are transverse forms (Fig. 3A). The transverse 
nature of the dunes is further supported by the 
lack of consistency in bifurcation orientation; 
within dunefields oriented parallel to net sediment- 
transporting winds, such defects tend to cluster 
in terms of their orientation (78). The transverse 
orientation also shows that these ridges cannot be 
sastrugi (erosional snow ridges that form parallel 
to net winds) (19), or other erosional features anal- 
ogous to yardangs (wind-carved ridges). The dunes 
in the northwestern portion of SP/AIM (Fig. 4A) 
are even more regularly spaced and parallel than 
many transverse dunes on Earth. Possible explan- 
ations for this include a highly consistent wind 
regime, lack of topographic deflection of winds, 
or a smooth substrate. 


Conditions for the formation of dunes 
and sublimation pits 


The existence of dunes on the surface of Pluto 
requires three necessary criteria to be met. First, 
there must be a fluid atmosphere of sufficient 
density to make eolian transport possible. Sec- 
ond, there must be a granular material of a size 
and density, and with sufficiently low cohesion, 
that it can be entrained by winds. On Earth, this 
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role is typically played by sand-sized mineral 
grains of a variety of compositions, including 
snow and ice. Third, given the high wind speeds 
needed to lift surface particles against cohesion 
forces (20) (Fig. 5), a specific mechanism must 
exist to loft large quantities of ice particles into 
the atmosphere where they are available for eolian 
transport. The presence of these criteria alone is 
necessary but not sufficient to identify the surface 
features as dunes. To justify our interpretation of 
these features as being dunes, we also examine the 
conditions required for the other most likely 
candidate: aligned sublimation pits. 


Winds 


The orientations of the dunes and the wind 
streaks change locally, and consistently; in the 
case of the dunes, over a distance on the order 
of 10 to 10° km. This implies that the topography 
and/or surface composition has influenced the 


local wind regime, as was anticipated (27). These 
orientations are consistent with sublimation- 
driven and topographic mechanisms for the 
horizontal displacement of the atmosphere, as 
winds are generated by a gravity-driven flow 
toward lower regions. Modeling of Pluto’s cur- 
rent atmosphere suggests that surface winds 
on the order of 1 to 10 ms’ are possible; they 
should be strongest where there are topographic 
gradients and when driven by sublimation of 
surface ices by sunlight (75). The location of the 
dunes at the western margins of the SP and AIM 
should thus be among the windiest locations 
on the known regions of Pluto. As with Earth and 
Mars, once grain transport along the surface of 
Pluto has begun, increased efficacy of grain-splash 
(the ejection of new particles due to grains in 
saltation colliding with the ground) promotes a 
hysteretic effect that further sustains sediment 
flux (22, 23). We use a numerical model (15) to 


Fig. 1. New Horizons flyby imagery of landforms attributed to eolian origins. All images are 
unrectified, and thus all scales are approximate. Color-composite MVIC images are shown here for 
context; dune identification was performed on grayscale LORRI images (shown below). (A) Overview 
of Pluto centered on ~25° latitude, ~165° longitude, showing the locations of images (B) and (E) 
and Fig. 3A and fig. S3 (47). (B) The spatial context for SP and the AIM mountains to the west (48). 
Insets (©) and (D) show details of the highly regular spatial patterning, which we attribute to 
eolian dune formation, and two newly identified wind streaks (arrows x), along the margins of the 
SP/AIM border. Here the dunes show characteristic bifurcations (arrows y) and a superposition with 
SP’s polygonal patterning (arrow z), suggesting a youthful age for these features (49). (E) Two 
further wind streaks on the surface (x’), downwind of the Coleta de Dados Colles (4). These wind 
streaks, farther from the SP/AIM margin, are oriented differently than those close to the icefield’s 
edge and are still roughly orthogonal to the dunes there (50). 
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demonstrate that despite the high wind speeds 
needed for initial eolian entrainment, eolian 
transport can, once established, be sustained 
with wind speeds of ~10 ms” (Fig. 5). 


Sediment supply 


Although terrestrial dunes are typically associated 
with quartz, basalt, or gypsum sand, other ma- 
terials can form the grains for dune development. 
Snow dunes of a very large scale are observed 
in the center of the Antarctic continent (24); on 
Titan, it is generally assumed to be atmosphere- 
derived organics, perhaps initially tholins, which 
form the equatorial belt of giant dunes (25). 
Whereas tholins are thought to form the dark 
patches of Pluto’s equatorial regions (6), the 
dunes evident on SP are light in color and are 
thus not formed from the same complex, organic, 
photochemically derived haze seen in Pluto’s 
atmosphere (4). The most likely candidates are 
thus N, and CH, ices. The surface of SP has 
generally been interpreted as predominantly 
composed of Nz ice (4, 5), just as solidified ni- 
trogen snows are believed to account for Triton’s 
ice-covered surface (26, 27). The zone in which 
the ridges occur is coincident with the latitudes 
in which net N, condensation occurs over the 
course of a Pluto year (16, 28) (fig. S3). However, 
recent analyses suggest that the composition 
may be amore complex mix of Nz, CH,, and CO 
ices (29). Our analysis of data from the MVIC 
instrument, using a CH, filter (75), suggests that 


Inset box D.and-E 


the location of the ridges and streaks coincides 
with a region of enhanced CH, ice content (fig. 
S4). To the west of SP, the Enrique Montes in 
Cthulhu Macula (CM) have been shown to be 
capped with methane, presumably as the result 
of condensation or precipitation (29, 30). CH4 
ice retains hardness and rigidity under Pluto 
surface conditions, which is ideal for saltation 
and dune formation, whereas N, ice is likely to 
be softer. These constraints lead us to conclude 
that the dunes are formed predominantly of 
grains of methane ice, though we do not rule 
out that there could also be a nitrogen ice com- 
ponent. The presence of transverse forms, in- 
dicating sediment-rich local conditions, as 
opposed to more sediment-starved isolated 
barchans (discrete, crescentic dunes), suggests 
that locally, the sediment supply to this region of 
SP must be, or must have been, abundant. Given 
the strength of the color and boundary delinea- 
tions of methane in the AIM (fig. S3), the meth- 
ane ice may be quite thick and perhaps similar 
to valley glaciers in these isolated regions. If 
such high-altitude methane snowpack is a re- 
gular, seasonal occurrence, this may be a substan- 
tial reservoir from which to derive the abundant 
sand across the northwestern surface of SP re- 
quired to form these transverse dunes. 


Grain size 


Credible sediment sizes are required for dune 
formation under the likely eolian regime. The 


grain sizes proposed for nitrogen ices (e.g., on 
Triton) have varied from micrometer (31, 32) 
to meter scale (33). We develop a method (15) to 
approximately constrain average grain size (d) 
and formative wind speed (U) from the mean 
crest-to-crest distance, or wavelength (A), of the 
transverse dunes. For eolian dunes, the relevant 
length scale controlling this wavelength is the 
saturation length (Z,,,) of the sediment flux, 
which is the distance needed by the flux to 
adapt to a change in local flow conditions. By 
combining theory (34), which predicts Z.,; as 
a function of wind speed and attributes of sedi- 
ment and atmosphere, with a mathematical 
model (35) for A as a function of Z,,, and U, 
we obtain the values of d and U that are con- 
sistent with 4. These values are shown in fig. S5, 
for 1 = '700 m and 4 = 560 m, which correspond 
to the transverse dunes far from and near to the 
mountainous area of Fig. 1, respectively. Given 
that expected formative wind speeds on Pluto 
are not larger than 10 m s‘ (J6), fig. S5 implies 
that grain size does not exceed ~370 um and is 
most probably in the range between 210 and 
310 um. The spectral response of the MVIC CH, 
filter offers an additional constraint on the pos- 
sible grain sizes observed, as Hapke modeling 
of the scattering within a granular medium 
provides a grain size-dependent control on the 
equivalent width of the absorption band. We 
find (75) that the observed response is consistent 
with a granular medium of ~200 to 300 um. 


4— Bifurcations 


Fig. 2. Identified features. (A) Dunes (black lines) at the margins of 
western Sputnik Planitia. Prominent wind streaks are marked with orange 
lines. (B) Radial plot of the orientation of the dunes (n = 331), and the 
direction orthogonal to the wind implied by the wind streaks close to the 
SP/AIM margin (orange dashed line; n = 4; arithmetic mean, x = 203°). 
Because the dunes have a distinct shift in orientation (fig. S1), the 
distribution of dunes in the three patches closest to the wind streaks 
within the dunefield [outlined in dashed green in (A)] has been separately 
highlighted on the radial plot, in green. These have a mean orientation of 
204° (n = 77), highlighted by the dashed green line. The dark blue line 
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indicates the mean trend of the border of SP and the Al-Idrisi Montes 

in this area (194°). (C) Frequency of dune spacings in clusters close to [red 
line representing dunes within the red dashed line of (A)] and far from 
[green line representing dunes within the green dashed line of (A)] the 
icefield/mountain interface. Dunes farther from the mountains are more 
widely spaced (x = 700 m) than those close to the mountains (x = 560 m). 
(D) Detail of the image interpretation process of the highest-resolution 
swath, showing linear ridges, which sometimes bifurcate but are 
otherwise notable for their regularity. (E) The same image with ridge 
lines highlighted. 
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Lofting 

Although eolian transport can be maintained 
under Pluto’s current wind regime, the speeds 
necessary for initial entrainment are orders of 
magnitude greater than those believed to be 
present at Pluto’s surface (Fig. 5). An additional 
process is thus likely to be necessary to initiate 
eolian activity. In Sputnik Planitia, this process 
may be related to the intense, solar-driven sub- 
limation of surface ices that injects more than 
10? m® m”? of gas into the atmosphere every 
afternoon [figure 8 of (16)]. When sunlight 
penetrates through, the upper layers of semi- 
transparent ice particles are lofted, sometimes 
at high vertical velocities, due to a mechanism 
often referred to as a solid-state greenhouse 
(36). Therefore, initial entrainment of ice grains 
that eventually form dunes may result from sub- 
limating subsurface ice, as has been observed in the 
thin atmospheres of Mars’ northern polar region 
(37) and proposed for comet 67P/Churyumov- 
Gerasimenko (38, 39) and Triton (24, 25). Model- 
ing (15) suggests that subsurface N, sublimation 
under Pluto surface conditions is capable of 
lofting even the densest candidate particles [N2 
ice, at 1030 kg m™’, is denser than CH, ice, at 
494 kg m~? (40)] with sizes ~200 um, even at 
0.1 Pa; within the range of both Pluto’s atmo- 
spheric pressure and the solid N. vapor pressure. 
Surface ices of mixed composition offer an ad- 
ditional potential mechanism for facilitating grain 
lofting. At the nitrogen frost point temperature 
of 63 K (11), pure methane ice particles mixed 
with nitrogen should not sublimate at all. As 
methane particles are slightly heated by the 
Sun, they should enable the sublimation of the 
nitrogen ice that they touch and thus be readily 
lofted into the atmosphere. Similar processes 
have recently been suggested for the migration 
of tholin deposits on the surface of Pluto (47). 
Past periods of higher atmospheric pressures, 
which have been suggested (42), could facilitate 
initial entrainment due to increased efficacy of 
eolian processes. 


Sublimation 


The landscape of SP contains evidence of 
sublimation-driven landforms (4, 7, 14), and 
this process is important in shaping parts of 
Pluto’s surface. We consider whether the land- 
forms described here are more consistent with 
origins attributable to eolian or sublimation pro- 
cesses. Locally, sublimation pits are deeply incised 
and may align to form linear troughs up to tens 
of kilometers long and up to ~1 km deep (4, 7). 
Frequently, and especially toward the southern 
and eastern margins of SP, any alignment is sub- 
sequently heavily deformed, presumably driven by 
glacial flow and convective overturning. Analog 
landforms on Earth are provided by sublimation- 
driven textures of snow and ice surfaces: ablation 
hollows (suncups) and penitentes (14, 43). On Earth, 
ablation hollows on snow may become aligned to 
leave ridges (penitentes), which align themselves 
to within +30° of east-west (i.e., the annual mean 
net Sun path) (44). Although the orientation of any 
penitentes on Pluto is likely to be more complex 
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Fig. 3. The western margin of Sputnik Planitia. (A) Transverse dunes are shown in black, the margin 
of the icefield and neighboring Al-Idrisi Montes to the northwest in blue, wind streaks close to this margin in 
orange, and further wind streaks farther from the mountains in yellow. There is an orientation shift 
between the two sets of wind streaks, matching the correlation between the distance to the margin of the 
icefield and mountains, and the orientation of the transverse dunes [(B); wind streaks in orange]. We 


interpret this as topography and/or surface composi 


and seasonally dependent, they are only likely to 
form wavelengths in excess of ~1000 m (45). It is 
possible that sublimation has acted upon already 
formed dunes in some regions (Fig. 2A). In polar 
regions on Earth, wind-driven snow or ice grains 
can produce dunes, which then become hardened 
by sintering and begin to undergo modification 
by wind and sublimation processes, thus changing 
from depositional to erosional landforms (44, 45). 
Given the tendency of ices to sinter together un- 
der the right conditions, this could also happen in 
the CH, or Ny ices of Pluto’s dunes. Sublimation 
erosion of Pluto’s dunes may enlarge and round 
the areas between the dunes, and sharpen the 
dune crests while preserving the overall dune ori- 
entation and spacing. This morphology may be 
seen just at the resolution limit in the features 
farthest from the mountains in Fig. 2A (enlarged 
view in fig. S2). This is supported by modeling (5) 
of the net accumulation of ices across Pluto’s sur- 
face during the past two (Earth) centuries (fig. S4), 
which suggests that for the past ~30 Earth years, 
the dunefield has been experiencing net sublima- 


tion influencing regional wind regimes. 


tion. Some of these features may have progressed 
so far toward being erosional that we have not 
identified them as dunes (fig. S2). 


Age 

An upper limit on the age of the dunes, which 
sit atop the ice of the western margins of SP, 
is imposed by the recycling rate of the upper 
surface of the convectional cells within the ice 
(i.e., <500 ka) (11, 12). This overturning of the 
substrate, inferred from the complete absence 
of identified craters on SP, provides an age 
constraint for superficial landforms that is not 
available for dunes on other solar system bodies 
and implies a geologically and/or geomorpho- 
logically active surface (4, 10, 46). Surface fea- 
tures, undistorted by the convectional overturning 
within the ice, must be much younger than the 
time scales of convection, and therefore closer 
to the time scales of Pluto’s strong seasons (i.e., 
terrestrial decades to centuries). Further evidence 
that the dunes form on a time scale substantially 


shorter than that of the convection is suggested 
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Fig. 4. Analogs and comparison with sublimation features. (A) Details of the dunes on western 


Sputnik Planitia, centered on 34.35° 159.84° (location shown in Fig. 1). (B) Analogous terrestrial transverse 
dunes of the Taklamakan Desert, western China [lmage credit: image CNRS/SPOT, DigitalGlobe and 
courtesy of Google Earth], and (C) the same location down-sampled to a relative resolution similar to that of 


the Pluto dunes (i.¢., ~5 to 10 pixels per crest-crest spacing). (D) The aligned and distorted sublimation 


features abundant on southern and eastern SP (image centered on —4.78° 189.48°) and (E) weakly aligned to 


randomly oriented, shallow sublimation pits. (F) An example of a landscape revealing both eolia 
and sublimation-derived landforms at Mars’ southern polar ice cap from the Mars Reconnaissance O 
reveals both dark eolian bedforms (dunes and ripples), as well as sublimation pits developing in 
underlying COz ice. [Image credit: NASA/JPL/University of Arizona, ESP_014342_0930_RED] 
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by the superposition of the dunes over the de- 
pressions at the cell margins (Fig. 1G). 


Summary and conclusions 


We have presented evidence that the highlands 
adjacent to SP accumulate methane. The ridged, 
dune-like landforms nearby, and accompanying 
wind streaks, are rich in methane relative to 
their underlying substrate. Although the wind 
speeds needed for eolian entrainment are higher 
than the likely wind speeds present on the sur- 
face, sublimation provides a credible mechanism 
for lofting grains. Numerical sediment transport 
and spectral modeling suggest that these methane 
grains are ~200 to 300 um. Our models suggest 
that eolian transport is highly effective under 
Pluto surface conditions once initiated. An ample 
sediment supply appears to be available from a 
seasonally abundant snowpack in the adjacent 
mountains. The result is the formation of trans- 
verse dunes, as we identify in the images from 
New Horizons. The orientation of the dunes per- 
pendicular to the wind is supported by the local 
topography and surface, and accompanying wind 
streaks. The presence of these dunes indicates 
an active atmosphere that produces geologically 
young landforms. 
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A bioinspired flexible organic 
artificial afferent nerve 
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Simiao Niu,° Jia Liu,® Raphael Pfatiner,® Zhenan Bao,°+ Tae-Woo Lee*+ 


The distributed network of receptors, neurons, and synapses in the somatosensory 
system efficiently processes complex tactile information. We used flexible organic 
electronics to mimic the functions of a sensory nerve. Our artificial afferent nerve collects 
pressure information (1 to 80 kilopascals) from clusters of pressure sensors, converts 

the pressure information into action potentials (O to 100 hertz) by using ring oscillators, 
and integrates the action potentials from multiple ring oscillators with a synaptic transistor. 
Biomimetic hierarchical structures can detect movement of an object, combine 
simultaneous pressure inputs, and distinguish braille characters. Furthermore, we 
connected our artificial afferent nerve to motor nerves to construct a hybrid bioelectronic 
reflex arc to actuate muscles. Our system has potential applications in neurorobotics 


and neuroprosthetics. 


hereas classical von Neumann-based 

computing systems rely on centralized 

and sequential operations determined 

by a clock, neurons and synapses in 

biological nervous systems process in- 
formation on the basis of distributed, parallel, 
and event-driven computation (J). As a result, 
classical systems exhibit the advantages of high 
speed and accuracy for well-defined problems, 
and biological systems are compact, fault toler- 
ant, and power efficient for complex real-world 
problems, such as visual information processing, 
speech recognition, and movement control. Bio- 
logical systems have influenced many fields of 
science and engineering, such as neuromorphic 
computing (2-5), bioinspired sensing systems 
(6-9), control theory for legged robots (JO, 11), 
and prosthetics (12-14). 

Biologically inspired systems have been imple- 
mented at the software level in classical von 
Neumann-based systems to distinguish braille 
characters (75) and control actuators in legged 
robots (10, 17). Alternatively, complex silicon cir- 
cuits have been developed to mimic the spike- 
based information processing in biological systems, 
in which the function of a synapse was emulated 
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by several conventional silicon transistors and a 
capacitor (16). 

Device structures that emulate the functional- 
ity and signal processing of biological compo- 
nents may potentially simplify complex circuits 
by mimicking multiple synapses with a single 
device. Organic devices are attractive because 
their characteristics can be tuned through chem- 
ical design (17, 18), they are compatible with 
printing methods that enable large-area cover- 
age at a low cost (73), and they have relatively low 
elastic moduli, similar to those of soft biological 
systems. 

The development of neuromorphic systems 
that mimic the sense of touch can benefit from 
the improved understanding of information pro- 
cessing in somatosensory peripheral nerves. 
The branched structure of slowly adapting type I 
(SA-I) afferent (sensory) neurons, which leads 
to a complex receptive field with many hotspots, is 
critical for sensing at a spatial resolution smaller 
than the spacing between the receptive field cen- 
ters (fig. S1) (19), discerning the movement of ob- 
jects, and distinguishing the orientation of edges 
on objects (79). The interneurons in the spinal cord 
have also been found to form synapses with multi- 
ple afferent neurons to encode and separate input 
features before tactile information is delivered 
to the cortex (20, 21). 

Here we describe an artificial afferent nerve 
based on flexible organic electronics (22-25). 
The bioinspired artificial afferent nerve emu- 
lates the functions of biological SA-I afferent 
nerves (Fig. 1A) by collecting data from multiple 
tactile receptors and conveying this information 
to biological efferent (motor) nerves, completing 
a hybrid bioelectronic reflex arc. Our artificial 
afferent nerve (Fig. 1, B and C, and fig. S2) con- 
sists of three core components: resistive pressure 


sensors, organic ring oscillators, and a synaptic 
transistor. Artificial mechanoreceptors consisting 
of a cluster of pressure sensors (each pressure 
sensor corresponds to a hotspot in the receptive 
field) are connected to an artificial nerve fiber 
(a ring oscillator) that converts external tactile 
stimuli into voltage pulses. The electrical signals 
from multiple artificial nerve fibers are then in- 
tegrated and converted into postsynaptic currents 
by a synaptic transistor. The synaptic transistor 
can be subsequently used to interface with bio- 
logical efferent nerves to form a complete mono- 
synaptic reflex arc. 

The resistive pressure sensors (13), with a 
sensitivity and working range comparable to 
those of biological receptors, are composed of 
a conducting pyramid-structured elastomer that 
forms a resistive pathway between CNT (carbon 
nanotube) and Au electrodes (Fig. 1B and fig. S3). 
An increase in pressure increases the contact 
area and therefore decreases the resistance be- 
tween the CNT electrode and the Au electrodes 
(Fig. 2A). 

The organic ring oscillator (figs. S4. and S5) was 
made of odd numbers of pseudo-complementary 
metal-oxide semiconductor (CMOS) inverters 
(figs. S6 and S87). The ring oscillator was de- 
signed to oscillate at frequencies that match 
the action potentials of sensory neurons (0 to 
100 Hz) (supplementary text) (26). Frequency- 
encoded information can be more robust to volt- 
age degradation and parasitic resistances than 
amplitude-encoded information. The supply volt- 
age to the ring oscillator increased with the 
pressure on the sensor (Fig. 2A). The oscillation 
of the ring oscillator is “off’ and the power con- 
sumption decreases (though not to 0 W because 
of the pseudo-CMOS design) when there is no 
pressure on the sensor. A constant nonzero pres- 
sure input leads to a fixed supply voltage to the 
ring oscillator, resulting in a constant frequency 
output that can be easily used to calculate the 
pressure input on the basis of a 1-to-1 relation- 
ship between pressure inputs and concurrent 
ring oscillator outputs, but this constant input 
also increases power consumption compared 
with that of biological SA-I afferent nerves, which 
have slow sensory adaption (supplementary text). 
An increase in pressure intensity resulted in an 
increase in both the frequency and the peak am- 
plitude of electrical impulses from the oscillator 
(Fig. 2B). This is slightly different from the case 
in biology, in which the amplitudes of action 
potentials are usually the same (supplemen- 
tary text). 

The synaptic transistor (17, 27, 28) was fab- 
ricated with the use of a solution-processed con- 
jugated polymer (the compound P1) (fig. S8B) as 
the hole-transporting semiconductor and an ion 
gel as the gate dielectric (figs. S8 to S11). Although 
the synaptic transistor was more limited in its ca- 
pacity for run-time tuning of the temporal dynam- 
ics during operation (fig. S12) than the rather 
complex silicon synapse consisting of several tran- 
sistors and a capacitor (J6), we were able to 
obtain decay times for postsynaptic currents (typ- 
ically 2 to 3 ms) (fig. S13) in a range comparable 
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to that of decay times for synapses of biological 
afferent nerves (1.5 to 5 ms) (21, 29) by choos- 
ing a specific polymer semiconductor during 
fabrication (table S1). The large variation in de- 
cay times within a material should have minimal 
effect on the output (30). The voltage outputs 
from the ring oscillators were connected to the 
gate electrodes of the synaptic transistor. The 
frequency of the current output from the syn- 
aptic transistor (Fig. 2C) matched the frequency 
of the voltage output from the ring oscillator 
(Fig. 2B). However, the pressure-current out- 
put from the synaptic transistor (Fig. 2C) was 
more linear than the pressure-voltage output 
of the ring oscillator (Fig. 2B). Our artificial af- 
ferent neuron consumes ~8 1W in the “off” state 
and ~25 uW in the “on” state (fig. S14). An array 
of our artificial afferent neurons can lead to a 
flexible artificial afferent nervous system—one of 
spiking neural networks (supplementary text)— 


A a 
(i) Pressures 
Biological 
Mechanoreceptor 
B 
~~ 
™ 
Pressure sensors 
Cc 


(ii) Receptor potential 
changes 


with lower power consumption than the sys- 
tem consisting of a conventional flexible one- 
transistor-one-resistor pressure sensor array 
connected to Si-integrated circuit chips, which 
consume additional power for readout and 
control (fig. $15 and table $2). 

Prolonged presynaptic voltage spikes can 
lead to more anion accumulation near or in 
the organic semiconductor and then increase 
the amplitude of postsynaptic currents (31). 
The peak postsynaptic currents are influenced 
by not only the magnitude of the pressure stim- 
ulus (Fig. 2, C and D) but also the duration of 
the pressure stimulus (Fig. 2, E and F). The du- 
ration of the pressure stimulus does not change 
voltage outputs from the ring oscillator. When 
we applied 500 cycles of 2-s pressure stimula- 
tion every 10 s (fig. S12), we detected slight in- 
creases in the postsynaptic currents and the 
decay times. 


4 
ee 
ww 


1 
(iii) Action rtertat 


Nerve fiber 


(ii) Voltage 
changes (iii) Voltage 


pulses 


The synaptic transistor can combine signals 
from multiple ring oscillators because the ion 
gel allows the active channel to be gated by mul- 
tiple electrodes (Fig. 3A and fig. S10C). The post- 
synaptic currents resulting from the pressures 
simultaneously applied to two pressure sen- 
sors (Fig. 3D) were comparable to the sum of 
the currents resulting from a pressure applied 
to two individual sensors (Fig. 3E = Fig. 3B + 
Fig. 3C). In the Fourier transform of each case 
(Fig. 3F), the synaptic transistor could combine 
signals from two pressure sensors and gener- 
ate postsynaptic currents consisting of two fre- 
quency components corresponding to those 
from the two pressure sensors. Analogous to 
the way in which dendrites of a postsynaptic 
neuron in connection with a number of bio- 
logical synapses add action potentials from 
multiple presynaptic neurons, a single synap- 
tic transistor can add voltage inputs coming 


(iv) Postsynaptic 
potentials 


I< 


currents 


=e 


Fig. 1. An artificial afferent nerve system in comparison with a 
biological one. (A) A biological afferent nerve that is stimulated by 
pressure. Pressures applied onto mechanoreceptors change the receptor 
potential of each mechanoreceptor. The receptor potentials combine 
and initiate action potentials at the heminode. The nerve fiber forms 
synapses with interneurons in the spinal cord. Action potentials from 
multiple nerve fibers combine through synapses and contribute to 
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information processing. (B) An artificial afferent nerve made of pressure 
sensors, an organic ring oscillator, and a synaptic transistor. Only one 
ring oscillator connected to a synaptic transistor is shown here for 
simplicity. However, multiple ring oscillators with clusters of pressure 
sensors can be connected to one synaptic transistor. The parts with 

the same colors in (A) and (B) correspond to each other. (C) A photograph 
of an artificial afferent nerve system. 
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from multiple ring oscillators. The synaptic tran- 
sistor thus integrates signals more simply than 
the traditional synapse circuit made of several 
transistors and a capacitor, which needs separ- 
ate circuits per input (J6), although the decay 
time and the synaptic weight of each gate input 
in our synaptic transistor can be individually 
tuned only during fabrication, not during op- 
eration. Compared with uniformly applied pres- 
sures (80 kPa in fig. S16, D and F), pressures of 
different magnitudes (20 and 80 kPa in Fig. 3, D 
and F) led to different temporal and spectral 
patterns of postsynaptic currents. Such patterns 
could be used to encode information about large- 
scale textures measured by SA-I afferents. 

One ring oscillator can combine signals from 
a cluster of pressure sensors, similar to a biolog- 
ical heminode (the beginning of the myelinated 
part of a nerve fiber) (26). When multiple pres- 
sure sensors are connected in parallel (Fig. 3G), 
the highest pressure will have a dominant ef- 
fect on the output of the ring oscillator. This 
behavior mimics the branched structures and 
heminodes of afferent nerves that give rise to 
a sophisticated sense of touch in higher-order 
animals (26). A rodlike object was moved across 
two branches of pressure sensors in two differ- 
ent directions (Fig. 3G). When the object moved 
in the directions of the red and blue arrows, 
the postsynaptic current outputs had one valley 
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Fig. 2. Characteristics of an artificial afferent nerve system with 

one branch. (A) Resistance of a resistive pressure sensor and the 
corresponding change in the supply voltage of an organic ring oscillator 
in response to the change of pressure. The pressure sensor and the 
organic ring oscillator formed a voltage divider between a dc power 
supply voltage and the ground (fig. S2). (B) Peaks and oscillating 
frequencies of output voltages of ring oscillators as a function of pres- 
sures applied to pressure sensors. (C) Peak values and oscillating frequencies 
of postsynaptic currents of synaptic transistors depending on pressures. 
The gate voltage of the synaptic transistor was supplied from the ring 
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corresponding to the object position between 
the two pressure sensors (Fig. 3H) and a mono- 
tonic increase without any local minimum (Fig. 
31), respectively. Analysis of the temporal pro- 
files of postsynaptic currents in these two cases 
demonstrates the possibility of a bioinspired 
approach to recognize the direction and poten- 
tially the speed of object movement, which can 
be estimated from the duration of the valley of 
postsynaptic currents. 

We used our artificial afferent nerves to iden- 
tify braille characters pressed on an array of 
three pixels by two pixels (Fig. 3J and fig. S17). 
Six synaptic transistors accepted inputs from one 
oscillator, and 11 synaptic transistors each ac- 
cepted inputs from two different oscillators. The 
peak frequencies of postsynaptic currents con- 
nected to one pixel (marked with a single num- 
ber, from 1 to 6, and a blue box in fig. $17, A and 
B) are shown for all six pixels (Fig. 3K and fig. 
S18A). To quantify the difference in postsynaptic 
responses between different letters, we used 
Victor-Purpura spike train metrics (32) to cal- 
culate the Victor-Purpura distances (Dyp) be- 
tween the characters (fig. S18, B and C). A 
larger Dyp means more dissimilarity between 
two spike trains. Compared with the smallest 
Dyp between different letters with the use of 
only pressure sensors and ring oscillators (i.e., 
without the signal integration of two pixels), 


B (i) > (ii) = (iii) 
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750 
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the synaptic signal integration led to an increase 
in the smallest Dyp between different alpha- 
bets, which means that braille letters became 
more distinguishable because of the synaptic 
integration (Fig. 3L). Thus, this approach mimics 
the process of tactile information processing in 
a biological somatosensory system, where the 
signals of multiple tactile inputs from first-order 
neurons are integrated by synapses to partially 
process the information before delivery to the 
brain (20, 27). 

We connected our artificial afferent nerve to 
biological efferent nerves of a discoid cockroach 
(Blaberus discoidalis) (Fig. 4A) to complete a 
hybrid monosynaptic reflex arc (Fig. 4B), emu- 
lating a biological reflex arc (fig. S19). We used 
this hybrid system to demonstrate the flow of 
information from multiple pressure sensors 
through a neuromorphic circuit to deliver bio- 
mimetic postsynaptic oscillating signals into the 
biological efferent nerves in a detached cock- 
roach leg (Fig. 4C), leading to the actuation of 
the tibial extensor muscle in the leg (Fig. 4D). 
The oscillating signals from our artificial afferent 
nerve elicit action potentials in nerves better than 
constant voltages (33). An increase in the ampli- 
tude and frequency of stimulation signals increases 
the number of activated muscle fibers and the 
forces generated by each muscle fiber, respectively 
(33, 34). When we increased the intensity and 
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oscillator output. (D) Postsynaptic current output of an artificial afferent 
nerve for three different pressure intensities. The duration of the stim- 
ulus application was 4 s for all three cases. (E) Response to three 
different durations of the pressure stimulus with a constant pressure 
intensity of 80 kPa. (F) The peak amplitude and frequency of the 
postsynaptic current depending on the duration of the stimulus applica- 
tion for the fixed amplitude of pressure (80 kPa). All error bars in (A) to 
(F) show 1 SD. (i) to (iv) correspond to the signals in Fig. 1. Arrows 
indicate the conversion of the signals by pressure sensors, organic ring 
oscillators, and synaptic transistors. 
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Fig. 3. Characterization of an artificial afferent nerve system with multi- 
ple branches. (A) Artificial afferent nerve with two branches of ring oscil- 
lators and pressure sensors measured in (B) to (F). (B and ©) Postsynaptic 
currents when only one pressure sensor was pressed with 20 kPa (B) and 
80 kPa (C), respectively. (D) Postsynaptic current when pressures of 20 and 
80 kPa were simultaneously applied to two pressure sensors. (E) Plot of the 
sum of currents from (B) and (C). Our synaptic transistor functions as an 
adder so that (D) and (E) are almost the same. (F) The amplitude of Fourier 
transform of the cases in (B) to (E). Frequency components corresponding 
to pressures are maintained after the pressure information is combined 

by a synaptic transistor. Each transform was done for 4 s of data and nor- 
malized to its maximum peak. (G) Artificial afferent nerve with a cluster of 
two pressure sensors used for movement recognition in (H) and (I). The width of 
electrodes was 800 um, and the distance between the two electrodes was 
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400 um. (H and I) Postsynaptic currents when an object is moved in the 
direction of the red arrow (H) and the direction of the blue arrow (1) in (G). 
(J) Portion of the connections used for braille reading in (K) and (L). Ring 
oscillators and synaptic transistors were connected to an array of three pixels 
by two pixels of pressure sensors. A synaptic transistor was connected to 
either one or two ring oscillators. The complete combinations of connections 
used are shown in fig. S17, A and B. (K) (Left) Applied pressures on the pressure 
sensor array. (Right) Peak frequencies of postsynaptic currents from synaptic 
transistors connected to only one pixel (the synaptic transistors in blue boxes in 
fig. SI7A). The results from all the braille characters are shown in fig. S18A. 

(L) The smallest Victor-Purpura distance (the metric used to quantify the 
difference between postsynaptic currents) between the postsynaptic currents 
of different alphabets. The integration of signals from two pixels by synaptic 
transistors improves the discrimination among the braille characters. 
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Fig. 4. Hybrid reflex arc. (A) Discoid cockroach with an artificial afferent 
nerve on its back. (B) Hybrid reflex arc made of an artificial afferent 
nerve and a biological efferent nerve. This experimental setup was used 
for measurements in (D) to (F). Pressure stimuli from multiple spots can 
be combined by an artificial afferent nerve and can be converted into 
postsynaptic currents. Postsynaptic currents are amplified to stimulate 
biological efferent nerves and muscles to initiate movement. (©) Photo- 
graph of reference and stimulating electrodes, a detached cockroach leg, 
and a force gauge used for (D) to (F). The tibial flexor muscle was 


duration of the stimulus application on the ar- 
tificial afferent nerve, the maximum isometric 
contraction force of the tibial extensor muscle 
increased accordingly (Fig. 4, E and F, respec- 
tively; additional data are in fig. S20). Isotonic 
contractions are shown in fig. S21 and movie S1. 

Inspired by state-of-the-art understanding of 
biological afferent nerves, we fabricated an arti- 
ficial afferent nerve based on organic devices 
that have multiple hotspots in the receptive field, 
generate action potentials depending on the 
combined pressure inputs, and integrate action 
potentials at a synaptic transistor. The biomi- 
metic hierarchical structures were used to detect 
the shape and movement of an object in simple 
cases and to distinguish braille characters. Final- 
ly, our artificial afferent nerve was connected to 
biological efferent nerves to demonstrate a hybrid 
bioelectronic reflex arc and control biological 
muscles. 
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INDUCED SEISMICITY 


The November 2017 M,, 5.5 Pohang 
earthquake: A possible case of 
induced seismicity in South Korea 


F. Grigoli,’* S. Cesca,” A. P. Rinaldi,’ A. Manconi,® J. A. Lépez-Comino,” J. F. Clinton," 
R. Westaway,* C. Cauzzi,' T. Dahm,” S. Wiemer" 


The moment magnitude (M,) 5.5 earthquake that struck South Korea in November 
2017 was one of the largest and most damaging events in that country over the past 
century. Its proximity to an enhanced geothermal system site, where high-pressure 
hydraulic injection had been performed during the previous 2 years, raises the 
possibility that this earthquake was anthropogenic. We have combined seismological 
and geodetic analyses to characterize the mainshock and its largest aftershocks, 
constrain the geometry of this seismic sequence, and shed light on its causal factors. 
According to our analysis, it seems plausible that the occurrence of this earthquake 
was influenced by the aforementioned industrial activities. Finally, we found that 

the earthquake transferred static stress to larger nearby faults, potentially increasing 


the seismic hazard in the area. 


eep geothermal resources can provide a 

valuable contribution to the production 

of renewable energy. Through enhanced 

geothermal systems (EGSs), geothermal 

energy production is no longer confined 
to volcanic or hydrothermal regions. Unlike the 
conventional geothermal systems, EGS technol- 
ogies exploit geothermal resources through hy- 
draulic stimulation—which involves the injection 
of high-pressure cold water to increase the per- 
meability of the target formation at a few kilo- 
meters of depth—by creating new fractures or 
enhancing existing ones. Although the potential 
for deep geothermal energy is indisputably large, 
in urban areas the problem of induced seismicity 
associated with such operations is often not ad- 
equately addressed. 

On 15 November 2017, a moment magnitude 
(M,,) 5.5 earthquake struck South Korea, injuring 
~70 people and causing extensive damage in and 
around the city of Pohang. This earthquake was 
preceded by the M,, 5.5 Gyeongju event of 12 
September 2016, which occurred ~30 km farther 
south on a major right-lateral fault, the Yangsan 
fault, which continues northward through the 
Pohang area (J, 2) (Fig. 1, A and B). These earth- 
quakes are the largest recorded in South Korea 
since instrumental monitoring of seismicity be- 
gan in 1903 (2). The proximity of the 2017 Pohang 
earthquake to an EGS site (Fig. 1B), where hy- 
draulic stimulation operations had recently taken 
place, has led to a public debate in South Korea 
regarding the potential anthropogenic origin of 
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this event. At this EGS site between early 2016 and 
September 2017, many thousands of cubic meters 
of water were injected under pressure into wells 
reaching ~4 km of depth (3). Although an in- 
vestigation by the South Korean government is 
currently ongoing, here we present observations 
that suggest a causal connection between the 
EGS activity and the most recent large earthquake. 

The Korean Peninsula is generally considered 
stable with low to moderate intraplate seismic 
activity, but the historical seismicity of this re- 
gion indicates large long-term variations in earth- 
quake rate and energy release (4, 5). The relatively 
low rate of activity since 1904 was preceded by 
much higher activity between the 15th to 18th 
centuries, with a peak of 1000 reported histor- 
ical earthquakes between 1500 and 1600 CE (5). 
The largest events reached magnitude ~7 (4, 5). 
The historic earthquakes were likely associated 
with the major fault systems of the area, such as 
the Yangsan fault, and highlight that these struc- 
tures are active (4, 5). In principle, given the 
historically varying rates of seismicity and prev- 
alence of faults in this region, the increase in 
seismic activity represented by the 2016 Gyeongju 
and 2017 Pohang earthquakes is not completely 
inconsistent with the historically varying rates 
of seismicity. This line of reasoning preserves the 
possibility that the occurrence of earthquakes 
close to the EGS site is a coincidence. 

We applied full-waveform seismological methods 
to regional and teleseismic data (Fig. 1A) (6), as 
we do not have access to open data from a local 
seismic network (with the exception of two ac- 
celerometers deployed in the epicentral area). 
We analyzed 15 days of continuous waveform 
data spanning from 15 to 30 November. We 
detected and relocated 46 events, most with 
magnitude M >2. The trend of these 46 epi- 
centers indicates a west-southwest (WSW)-to- 
east-northeast (ENE) strike of the fault that 
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ruptured in the mainshock (Fig. 2A). We deter- 
mined 3- to 7-km hypocentral depths for most of 
these events (Fig. 2, A and B). These depths are 
shallower than the depths of typical seismic events 
in the area (~12 to 15 km) (Fig. 2C) (2, 4). For the 
Pohang earthquake, we determined the depth of 
both the mainshock and the largest aftershock 
to be 4.0 to 4.5 km (Fig. 2B). Depth is a critical 
parameter for discrimination between natural 
and induced seismicity (7), so we obtained inde- 
pendent estimates (by using array analysis at 
teleseismic distances and two accelerometers 
located in the epicentral region) that confirm 
the shallow depth (6). Our moment tensor 
inversion indicated that the mainshock had a 
reverse-to-oblique double-couple (DC) mecha- 
nism, with a WSW-to-ENE striking nodal plane, 
subparallel to the aftershock zone and dipping 
north-northwest at ~66° The full moment tensor 
had a large non-DC term. Conversely, the My 4.3 
aftershock indicated reverse faulting on a WSW- 
to-ENE striking fault (Figs. 1B and 2A). On the 
basis of many previous analyses [e.g., (8, 9)], we 
inferred that the non-DC component (Fig. 2A) is 
caused by a complex rupture process that includes 
the (near-)simultaneous activation of differently 
oriented faults. By mapping the azimuthal dis- 
tribution of the apparent source durations, maxi- 
mum energy peaks, and centroid time delays, we 
detected a common pattern, which we interpret as 
the failure of two subevents at close origin times 
and separated by a short distance along the azi- 
muth of rupture directivity. We estimated a dis- 
tance of 3.5 to 4.0 km between the two subevents, 
indicative of a dynamic triggering process (6). 
We thus hypothesize that the earthquake involved 
the failure of two different faults with slightly dif- 
ferent orientations, which might, in principle, ex- 
plain the non-DC term of the moment tensor (9), 
as well as the complexity of P-wave signals for 
the mainshock (6). A potential alternative source 
model, characterized by a complex rupture along 
a single fault, with heterogeneous slip directions, 
cannot explain the non-DC component of the 
moment tensor and results in a substantially 
worse fit for the pattern of the relative hypo- 
central centroid location (6). 

We also used satellite radar interferometry 
[differential interferometric synthetic aperture 
radar (DInSAR)] to map and measure the co- 
seismic surface deformation associated with the 
mainshock and to independently model its source 
geometry (6, 10). Our InSAR analysis indicated 
a maximum surface deformation of ~5 cm, with a 
fault location and area consistent with the seis- 
mological analysis (Fig. 3A). Despite the moderate 
event size and complex rupture resolved by seis- 
mological data, we can fit the observed deforma- 
tion with a simple elastic dislocation model (17) 
based on a single fault plane with shear displace- 
ment and opening (Fig. 3B). The fault dimensions 
dength ~5 km and width ~1.6 km) and the ap- 
parent slip, derived from this inversion of geo- 
detic data, are compatible with a magnitude of 
M,,5.5 (12) and consistent with our seismological 
analysis. However, DInSAR data cannot resolve 
the small-scale complexities associated with the 
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progressive rupture of adjacent fault patches. 
Nonetheless, our geodetic analysis confirmed 
that the earthquake nucleation and main slip 
on the fault occurred at very shallow depth (4 to 
5 km), on a reverse fault striking WSW-to-ENE 
and with a ~75° dip toward the NW (Fig. 2B). The 
average residuals are ~0.05 cm and within the 
measurement accuracy (Fig. 3C). The DInSAR 
results are in good agreement with the after- 
shock locations and the focal mechanisms of the 
largest events, placing strong independent con- 
straints on the location and extent of a previously 
unmapped fault system. 

Natural and induced earthquakes are not dis- 
tinguishable by their waveform characteristics. 
It is generally necessary to build a convincing 
chain of evidence to differentiate induced earth- 
quakes from naturally occurring events (7, 13-16). 
For instance, the hypothesis that the Pohang 
earthquake sequence is anthropogenic is sup- 
ported by the spatial correlation between the 
mainshock and its aftershocks and the injec- 
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tions. Our seismological and geodetic analyses 
indicate that the activated fault passes directly 
beneath the EGS site (Fig. 2B), within ~1 km of 
the termination of the injection wells (Fig. 2B). 
The combined evidence from the hypocentral lo- 
cations, the DInSAR data inversion, the observed 
interval between P- and S-wave arrival times at 
the borehole and surface stations, and the ob- 
served strong motion and damage patterns is 
consistent with this interpretation (6). Another 
piece of evidence supporting that this seismic- 
ity is induced is our relocation of the local mag- 
nitude 3.1 earthquake on 15 April 2017, which 
occurred during hydraulic stimulation opera- 
tions at the EGS site: This event was very close 
to the 15 November mainshock (Fig. 2A). No 
stimulation activities occurred in the 2 months 
preceding this mainshock, but induced earth- 
quakes can be delayed by days, weeks, or even 
months after the start or end of injection (14). 
Our techniques (using regional and teleseismic 
data from the public domain, thus restricted to 
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earthquakes of magnitude >~2) demonstrate 
the extent to which a candidate case study of 
induced seismicity can be investigated using 
public data, without proprietary data from site 
operators. 

We also studied the relation between the 
seismic moment (M, ~ 1.7 x 10” N-m) of the 
mainshock (6) and the volume (V) of fluid in- 
jected. Assuming that unreported injections are 
similar to initial stimulation (3), we estimated an 
upper bound to the total volume of ~10,000 m?. 
The limitation of water supply and storage ca- 
pacity of the EGS justifies our assumption. We 
approximated an upper bound to M, by using 
the product of net injected volume and the shear 
modulus (17). For V = 10,000 m’, this relation 
implies an upper bound of M, ~ 2 x 10'* N-m, 
or M,, ~ 3.5. On this basis, an induced earth- 
quake of M,, 5.5 would require three orders of 
magnitude more volume (V ~ 10” m?). However, 
several counterexamples to this scaling relation 
exist [(18, 19) and references therein], and the 


Fig. 1. The 2016 and 2017 

M, 5.5 earthquakes in South 
Korea. (A) Regional map 

showing locations of the Gyeongju 
and Pohang earthquakes, the 
Yangsan fault, and the available 
open seismic stations. (B) Map 

of the study area showing the 
main faults of the area, the 
distribution of seismicity with 
respect to the EGS site, and the 
mechanisms of the largest events. 
A more detailed map of the area 
of study (outlined by the yellow 
square) is shown in Fig. 2A. UTC, 
universal time coordinated. 
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Fig. 2. Spatial distribution of the 2017 Pohang seismic sequence. 
(A) Detailed map showing the epicentral distribution of seismicity and 
(B) two cross sections displaying the depth distribution of seismicity 
(including location uncertainties) and the fault as inferred by geodetic 


analysis. The EGS site is located at 36°06'23.34"N, 129°22'46.08"E and 
includes the two injection wells that reach depths of 4127 and 4348 m 
(3). (C) Focal depth distribution of earthquakes in the study region and 
comparison with the 2016 Gyeongju seismic sequence. 
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Fig. 3. DInSAR data and model. (A) Surface deformation (satellite 
line-of-sight displacements) obtained with InSAR. Seismicity and the 
extrapolated fault trace are indicated by black circles and a dashed line, 
respectively. (B) Modeled surface deformation using a rectangular fault plane 
with the following parameters: latitude = 36.100° + 0.005° longitude = 
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Surface Deformation (m) 
DE («0.04 


129.383° + 0.003° (center of the rectangular fault), depth = 4.3 + 0.3 km 
(upper edge of the fault), strike (from north) = 225° + 12° dip (from horizon) = 
75° + 115 length = 5.0 + 0.7 km, width = 1.6 + 0.4 km, slip = 1+ 0.22 m, and 

rake = 123° + 35° (C) Difference between InSAR data and model. The standard 
deviation is <0.5 cm, which is below the accuracy threshold of the measurements. 
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Fig. 4. Coulomb stress modeling. (A) Static coulomb failure stress showing the effect of the 
2016 Gyeongju earthquake at 4 km of depth on the fault activated by the 2017 Pohang event and 
(B) the effect of this latter event on the Yangsan fault at 14 km of depth. Receiver faults are denoted 
by red dot-dashed lines. The triangle represents the local accelerometer, and the squares denote 


the injection wells. 


Pohang earthquake provides much clearer evi- 
dence that the use of this relation should be 
subject to caution. 

Although none of our observations exclude 
the possibility that the Pohang earthquake was 
induced by industrial activity at the EGS site, our 
seismological and geodetic analyses (Figs. 2 
and 3) rule out a reactivation of the Yangsan 
fault. Our hypocenter is <1 km southeast of the 
injection point, at the same depth as the in- 
jection. The coseismic deformation and source 
model derived from the DInSAR analysis con- 
firm this location (Figs. 2 and 3). Aftershock 
locations and focal mechanisms consistently in- 
dicate activation of a previously unmapped fault 
system. This case thus highlights the importance 
of a preliminary seismotectonic assessment of 
the area surrounding any future EGS project site, 
aimed at identifying potentially active faults. 
Along with real-time analysis and control sys- 
tems (20), this evaluation may mitigate the risk 
associated with induced seismicity. 

We also investigated whether the September 
2016 M,, 5.5 Gyeongju earthquake might have 
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contributed to triggering the 2017 Pohang event. 
Coulomb stress modeling (6) indicates that the 
Gyeongju earthquake caused a slight (~0.0005 MPa) 
increase in static stress on the fault that ruptured 
in the subsequent Pohang earthquake. This value 
is small, but it is feasible to propose the occurrence 
of a clock advance on the subsequent fault. In turn, 
the M,, 5.5 Pohang earthquake transferred static 
stress of 0.015 MPa onto the northern part of the 
Yangsan fault (Fig. 4B), potentially increasing the 
seismic hazard in this area. 

In accordance with our findings, it is plausible 
that the occurrence of the 2017 Pohang earth- 
quake was influenced by the nearby stimula- 
tion activities. If so, the Pohang event was the 
largest and most damaging earthquake ever to 
have been associated with an EGS, making it a 
potential “game changer” for the geothermal 
industry worldwide. 
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INDUCED SEISMICITY 


Assessing whether the 2017 M,, 5.4 
Pohang earthquake in South Korea 
was an induced event 


Kwang-Hee Kim,” Jin-Han Ree,”* YoungHee Kim,” Sungshil Kim,” 


Su Young Kang,’ Wooseok Seo’ 


The moment magnitude (M,,) 5.4 Pohang earthquake, the most damaging event in South Korea 
since instrumental seismic observation began in 1905, occurred beneath the Pohang 
geothermal power plant in 2017. Geological and geophysical data suggest that the Pohang 
earthquake was induced by fluid from an enhanced geothermal system (EGS) site, which was 
injected directly into a near-critically stressed subsurface fault zone. The magnitude of the 
mainshock makes it the largest known induced earthquake at an EGS site. 


he injection of fluid into reservoir rocks, 

which facilitates oil and gas recovery, en- 

hances geothermal systems, and aids in the 

disposal of wastewater and CO, gas, also 

has a small chance of inducing earthquakes 
[e.g., d-4#)]. Empirical and theoretical relationships 
exist to connect the maximum magnitude of an 
induced earthquake and the injected fluid vol- 
ume (2, 5). The magnitude of an induced earth- 
quake may be tectonically controlled—for example, 
owing to the presence of a fault suitably oriented 
for slip under a given stress field that is also loc- 
ated adjacent to one or more injection or pro- 
duction wells (6, 7). The earthquake nucleation 
itself might be controlled by the injection (6). 
Previously observed magnitudes of induced seis- 
micity at enhanced geothermal system (EGS) 
sites have been relatively small (8); the largest 
local magnitude (M;) reported was 3.4, in Basel, 
Switzerland (9), although a much larger, possibly 
induced earthquake has been reported in the 
Cerro Prieto geothermal field, Mexico, a tectoni- 
cally active area (JO). 

A moment magnitude (M,,) 5.4 earthquake 
occurred at the Pohang EGS site in southeastern 
Korea on 15 November 2017. The earthquake 
was the most damaging and the second-largest 
in magnitude in South Korea since the first 
seismograph was installed in 1905. The earth- 
quake injured 90 people, and the estimated prop- 
erty damage was US$52 million (11). We present 
evidence that the Pohang earthquake was the 
largest induced event to have occurred at any 
EGS site worldwide. Moreover, this event in- 
dicates that injected fluid volumes much smaller 
than predicted by theory can trigger a relatively 
large earthquake, at least under the right set of 
conditions. 
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The Korean Peninsula lies within the Eurasian 
Plate (Fig. 1), although it was composed of con- 
tinental magmatic arcs at a plate boundary until 
the early Tertiary period (~30 million years ago) 
(12). North-northeast (NNE)-striking strike-slip 
faults and NNE- to NE-striking normal faults 
developed predominantly in southeastern Korea 
and adjacent offshore areas when the East Sea 
(or Japan Sea) opened as a back-arc basin in the 
early to middle Tertiary (~30 to 15 million years 
ago), with the coetaneous formation of smaller- 
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Fig. 1. Tectonic map of northeast Asia. 
Saw-toothed lines with solid teeth denote 
subduction zones. The broken line with open 
teeth represents an incipient subduction zone 
(30). EU, Eurasian Plate; NA, North American 
Plate; PS, Philippine Sea Plate; PA, Pacific Plate. 


scale basins, including the Pohang (12-74). Some 
of these faults have been reactivated as strike-slip 
and thrust faults in the current compressional 
regime (13-15). The axes of compression deter- 
mined by focal mechanism solutions indicate 
shallow plunges to the ENE throughout the 
southern Korean Peninsula (15, 16). 

The Pohang basin consists of nonmarine to 
deep marine sedimentary strata dating to the 
Miocene (~20 million years ago), with a base- 
ment composed of Cretaceous to Eocene sedi- 
mentary and volcanic rocks and late Paleozoic to 


Eocene granitoids (17-19) (Fig. 2A). The geology 
of the Pohang EGS site comprises (from top 
to bottom) Quaternary alluvia (<10 m thick), 
Miocene semi-consolidated mudstone (200 to 
400 m thick), Cretaceous to Eocene sedimentary 
and igneous rocks (~1000 m thick), and Permian 
granodiorite with gabbroic dykes (19-2D) (Fig. 2B). 

One vertical injection well (4346 m deep; PX2) 
and another deviated production well (4362 m 
deep; PX1) were drilled into Permian grano- 
diorite with gabbroic dykes for the EGS, with an 
expected electricity production of 1.2 MW (21). 
The PX] well, only 6 m from PX2 at the surface, 
is 600 m northwest of PX2 at the bottom (Fig. 3). 
The drilling began in September 2012 and was 
completed in November 2015. No earthquakes 
with M;, > 2.0 were recorded within 10 km of the 
Pohang EGS site between 1978 and 2015 (22); a 
total of six earthquakes with My, 1.2 to 1.9 were 
detected in the area between 2006 and 2015. To 
further examine the seismicity around the EGS 
site, we improved the earthquake catalog by 
applying a matched filter to continuous wave- 
forms (23) recorded by a permanent seismic 
station (PHA2; Fig. 2A), located about 10 km 
north of the EGS site, during the period between 
1 January 2012 and 14 November 2017. Once 
detected, waveforms were visually inspected for 
time differences between P- and S-wave arrivals 
consistent with a source at Pohang (~1.54 s). 
The matched-filter analysis found no noticeable 
earthquakes at the EGS site before the comple- 
tion of drilling. We detected a total of 148 earth- 
quakes by the match filtering that all occurred 
after the completion of the drilling, including 
four earthquakes with My, > 2.0. 

Hydraulic stimulation began on 29 January 
2016 and comprised four phases of injection 
with a total volume of 12,800 m?® at injection 
rates of 1.00 to 46.83 liter s_! (Fig. 4). Fluid was 
injected into both PX1 (phases 2 and 4) and PX2 
(phases 1, 3, and 4). To investigate the relation- 
ship between seismicity and fluid injection, we 
used data on regional earthquakes detected by 
the matched filter together with earthquake data 
provided by the Ministry of Trade, Industry, and 
Energy (MTIE), Republic of Korea (23). We did 
not detect noticeable microearthquakes before 
the drilling. The timing of the earthquakes coin- 
cides with that of fluid injections. The first re- 
ported hydraulic stimulation (phase 1) was carried 
out between 29 January 2016 and 20 February 
2016, followed by three additional phases of 
fluid injection (Fig. 4). Each injection phase 
was accompanied by intense seismic activity 
that started only a few days after injection. 
Microseismic activity decreased rapidly after the 
termination of fluid injection. The magnitudes 
of induced earthquakes tend to increase with 
the net volume of injected fluid. After a My, 3.1 
earthquake on 15 April 2017, which was the 
largest felt event near the EGS site before the 
My, 5.4 Pohang earthquake, we deployed eight 
temporary seismic stations around the EGS site. 
Each standalone station was equipped with a 
three-component velocity-type short-period 
sensor. These sensors record continuous seismic 
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Fig. 2. Geologic map and column of 
the Pohang basin. (A) Map showing 
rocks and faults of the Pohang basin 
and adjacent area. One permanent 
seismic station operated by the Korea 
Meteorological Administration (PHA2) 
and our eight temporary seismic stations 
are represented by a dark blue square 
and green triangles, respectively. The 
green square denotes the site of the 
Pohang enhanced geothermal system 
(EGS). The geologic map was compiled 
from (18, 19). (B) Geologic column of the 
Pohang EGS site with injection well. The 
geologic column was compiled from 

(19, 20). 
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Fig. 3. Spatial distribution of epicenters and hypocenters of the 
2017 Pohang earthquake sequence. (A) Epicenters of six foreshocks 
(red circles), mainshock (red star), and 210 aftershocks (black open 
circles) recorded in the first 3 hours after the mainshock. The location of 
the Pohang EGS is indicated by a green square. Blue triangles represent 
our eight temporary seismic stations. The red beach ball represents 
the source mechanism of the mainshock. Black beach balls show 

the focal mechanism solutions of representative aftershocks. Numbers 
above beach balls are local magnitudes. The black beach balls of 


data at a sampling frequency of 200 Hz. Installa- 
tion was completed on 10 November 2017. All of 
them are still in active operation. 

The M, 5.4 mainshock occurred at the Pohang 
EGS site on 15 November 2017 and was preceded 
and followed by foreshocks and aftershocks, re- 
spectively, all of which were well recorded by our 
local seismic array at distances of 0.6 to 2.5 km 
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from the mainshock epicenter (Figs. 2A and 3). 
We precisely relocated this earthquake sequence 
with the hypoDD software package (23, 24) (fig. 
$2). We plotted the spatial distribution of six 
foreshocks (My, < 2.6), the mainshock, and 210 
aftershocks that occurred within 3 hours of the 
mainshock (Fig. 3). The first two foreshocks 
occurred about 9 hours before the mainshock; 


Distance (km) 


the M, 2.4 and 3.5 aftershocks, recorded 1 and 3 days after the 
mainshock, respectively, show strike-slip faulting. X-X’ and Y-Y’ denote 
the locations of the cross sections shown in (B) and (C), respectively. 
(B and C) Hypocentral distributions of earthquakes, projected onto 
vertical planes along the lines X-X' (B) and Y-Y' (C) shown in 

(A). The red beach ball in (B) represents the focal mechanism of 

the mainshock projected onto a vertical cross section. PX1 and PX2 
denote production and injection wells, respectively. Other symbols are 


the remaining four preceded it by 6 to 7 min. 
Most hypocentral depths fell in the range of 4 
to 6 km, and the mainshock depth was about 
4.5 km. Of note, the hypocenters of the fore- 
shocks and mainshock were located immediately 
adjacent to the bottom of PX1. The hypocentral 
depths of the Pohang earthquake sequence are 
shallower than those of most earthquakes in the 
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Fig. 4. History of fluid injection volume. Volume of injected and bleed-off fluid (left axis) and net 
fluid volume (right axis) as a function of time. Red circles denote times of seismic events from data 
provided by the MTIE (63 events; magnitude information is only available for M > 1). Dark blue circles 
denote seismic events determined by matched-filter analysis (148 events). The right axis also gives 


the local magnitudes of seismic events. 


Korean Peninsula, which tend to occur at depths 
of 10 to 20 km (25). For comparison, the largest 
instrumentally recorded earthquake in South 
Korea, the 2016 Gyeongju event (M;, 5.8), had a 
hypocentral depth of about 14 km (26). The 
spatiotemporal distribution of hypocenters in- 
dicates that the rupture plane consisted of two 
segments, a main southwestern segment and a 
subsidiary northeastern segment (Fig. 3 and 
fig. S3). The aftershocks tended to occur earlier 
on the main segment than on the subsidiary 
segment (fig. S4). This observation, together 
with the locations of the foreshocks and main- 
shock on the main segment, suggests that the 
main segment ruptured earlier than the sub- 
sidiary one. By statistical plane-fitting using 
MATLAB software (Mathworks), we determined 
the best-fit orientations of the main and sub- 
sidiary rupture planes to be N36°E (strike)/ 
65°NW (dip) and N19°E/60°NW, respectively 
(fig. S3). These orientations are consistent with 
the nodal planes determined by focal mechanism 
solutions (Fig. 3). The main segment shows thrust 
faulting with a minor strike-slip component, 
whereas the subsidiary segment is dominated 
by strike-slip faulting with a minor dip-slip com- 
ponent. The compression axis trends E-W or 
ENE-WSW with a shallow plunge, similar to that 
of other earthquakes in South Korea (15, 16). The 
locations of the foreshocks and mainshock, at the 
bottom of the injection well, suggest that fluid 
was injected directly into the fault zone. 

The temporal relationship between seismicity 
and fluid injection, the spatial relationship be- 
tween the hypocenters and the EGS site, and the 
lack of seismicity in the area before the EGS was 
established all suggest that the Pohang earth- 
quake was induced. Furthermore, the immediate 
response of seismicity to fluid injection and the 
locations of the foreshocks and mainshock at the 
bottom of the injection well suggest that fluid 
was injected directly into a fault zone. The fault 
plane inferred from the spatial distribution of 
hypocenters and focal mechanism solutions 
strikes NE and dips to the NW (fig. S4A), similar 
to Quaternary thrust faults in southeastern Korea. 
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In fact, a magnetotelluric survey of the EGS site 
detected a low-resistivity feature that could be the 
fault zone, striking NE and dipping to the NW 
(23, 27) (fig. S5). Reverse slip along a subsurface 
fault is consistent with the current stress field. 
All of these lines of evidence indicate that the 
Pohang earthquake was “almost certainly induced” 
in Frohlich et al.’s system of assessment (28). If 
we use McGarr’s (2) equation for the relation- 
ship between the maximum magnitude and the 
total volume of injected fluid, about 4.7 x 10° m?® 
of injected fluid would be required to induce 
an M,, 5.4 earthquake, which is more than 810 
times the fluid volume injected at the Pohang 
EGS site. The permeability structure of fault zones 
is highly heterogeneous, and patches or layers 
of clay-rich gouge within the fault core act as 
barriers to fluid flow (29). The pore pressure thus 
can locally reach a critical value for earthquake 
nucleation after a relatively small volume of fluid 
is injected, depending on fault zone structure. 
Our results imply that if fluid is injected directly 
into a near-critically stressed fault, it can induce 
a larger earthquake than current theory pre- 
dicts. Detailed investigation of the geological, 
geochemical, and geophysical properties of the 
Pohang EGS site will improve our understand- 
ing of earthquake-inducing processes. 
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A radical approach to the copper 
oxidative addition problem: 
Trifluoromethylation of bromoarenes 


Chip Le,* Tiffany Q. Chen,* Tao Liang,* Patricia Zhang, David W. C. MacMillan{ 


Transition metal-catalyzed arene functionalization has been widely used for molecular 
synthesis over the past century. In this arena, copper catalysis has long been considered a 
privileged platform due to the propensity of high-valent copper to undergo reductive 
elimination with a wide variety of coupling fragments. However, the sluggish nature of 
oxidative addition has limited copper’s capacity to broadly facilitate haloarene coupling 
protocols. Here, we demonstrate that this copper oxidative addition problem can be 
overcome with an aryl radical-capture mechanism, wherein the aryl radical is generated 
through a silyl radical halogen abstraction. This strategy was applied to a general 
trifluoromethylation of aryl bromides through dual copper-photoredox catalysis. 
Mechanistic studies support the formation of an open-shell aryl species. 


he development of fragment coupling trans- 

formations by transition metal catalysis has 

substantially expanded access to valuable, 

complex organic molecules (1). Major efforts 

in this field have been devoted toward the 
use of aryl halides, stable and highly accessible 
building blocks, as coupling partners for the 
construction of functionalized arenes (2). The 
success of these transformations generally re- 
lies on the transition metal catalyst to activate 
haloarenes through an oxidative addition step 
and, thereafter, forge the desired bond by a re- 
ductive elimination step. This paradigm is exem- 
plified in modern palladium and nickel catalysis, 
wherein catalyst design over four decades has 
rendered these elementary steps generically ef- 
ficient, thereby enabling the development of a 
vast array of coupling manifolds that convert 
(hetero)aryl C-X bonds to C,)"-carbon, -nitrogen, 
-sulfur, and -oxygen bonds (3, 4). In contrast, 
copper catalysis has been less successful than 
Pd or Ni with respect to diverse applications in 
cross-coupling chemistry, despite its potential 
for economical and operational benefits. Indeed, 
the most robust copper cross-coupling reactions 
are limited to transmetalling reagents such as 
organoboronic acids, specifically to bypass the 
challenge of oxidative addition (5). 

The limited capacity of copper to participate 
in haloarene functionalizations has been attrib- 
uted to its relatively low rate of oxidative ad- 
dition in comparison to nickel and palladium. 
Formation of the key Cu(III)-arene intermediate 
has been calculated to possess a high kinetic bar- 
rier and is typically the rate-determining step in 
such catalytic cycles (6—8). Although elegant 
studies in ligand design have allowed for im- 
provements in the rates of oxidative addition 
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for certain substrate classes, functionalization 
of aryl halides by copper catalysis remains largely 
restricted to aryl iodides and activated aryl bro- 
mides (9-11). The need to overcome this oxida- 
tive addition problem is further underscored 
by the knowledge that the subsequent reduc- 
tive elimination step from high-valent Cu(III) 
complexes is extremely facile for a wide range 
of coupling partners (12). In many cases, the 
reductive elimination of highly electronegative 
groups such as Cyp”-CFs, Cp FE and Cyp-N bonds 
is more favorable and more facile with Cu than 
Pd and Ni (Fig. 1) (, 13, 14). As such, the in- 


x 
fl ia oxidative pln, 
Dm N—. iieqeaeten 2 o8 7 
Z addition 1 
aryl halides 


sluggish 
oxidative addition 


@ Generally slower than Pd or Ni 


Hf Often requires weak C—X bonds (C-l) 
and/or high temperatures 


Typically rate determining step 


vention of a mechanistic paradigm that bypasses 
the Cu-haloarene oxidative addition problem 
should enable a range of catalytic cross-coupling 
reactions of scope and utility that has not been 
broadly realized using other forms of metal ca- 
talysis (including Pd, Ni, or Cu). Here, we describe 
a silyl radical/halogen abstraction/copper cap- 
ture mechanism that effectively functions as a 
surrogate for this elementary oxidative addition 
step and, in doing so, allows copper-mediated 
bromoarene functionalization in a general format 
and under mild conditions (visible light and room 
temperature). Moreover, this catalytic strategy has 
been exploited to deliver a general approach to 
the trifluoromethylation of heteroaryl and aryl 
bromides through the combination of copper 
and photoredox catalysis. 
Trifluoromethylarenes are valuable structural 
motifs in the area of drug discovery due to the 
enhancement of desired physicochemical prop- 
erties upon the introduction of the trifluoromethyl 
group (15, 16). Among synthetic strategies, tran- 
sition metal-catalyzed conversion of aryl halides 
to the corresponding trifluoromethylarenes is 
considered one of the most attractive approaches 
(9, 17). Despite extensive effort, nickel catalysis of 
this transformation has not been achieved, and 
only a single report of palladium-catalyzed tri- 
fluoromethylation of aryl chlorides has been dis- 
closed (18). Such diminished catalytic reactivity 
in comparison to other well-known coupling re- 
actions is ascribed to the C,,”"-CF; reductive elim- 
ination step, a well-documented challenge for both 
palladium and nickel catalysts (77, 19, 20). In 
contrast, copper complexes have been shown to 
generally undergo facile Cop -CF reductive elim- 
ination (21, 22). This realization has propelled 
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Fig. 1. Trifluoromethylation with dual copper/photoredox catalysis. Ary! radical capture by 
copper catalysts can bypass oxidative addition. Ar, (hetero)aryl. 
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extensive research efforts to develop copper- 
catalyzed trifluoromethylation protocols with 
haloarenes. Although recent examples of copper- 
mediated (23—25) and copper-catalyzed (26—28) 
methodologies have provided promising results 
for the construction of aryl-CF; motifs, the lim- 
ited capacity of Cu(1) to participate in oxidative 
addition has restricted the application of these 
elegant systems to mostly aryl iodides or acti- 
vated aryl bromides, often employing elevated 
temperatures. As such, a mechanistic pathway that 
can bypass the copper oxidative addition prob- 
lem, while benefiting from the facility of CudIT]) 
to engage in difficult reductive elimination, might 
enable a mild and broadly general strategy toward 
the trifluoromethylation of aryl bromides. 

We recently developed a metallaphotoredox- 
catalyzed (29) cross-electrophile coupling protocol 
in which alkyl radicals were generated by a silyl 
radical-mediated halogen abstraction (30). This 
manifold was shown to provide facile access to a 
wide range of alkyl radicals from the corre- 
sponding alkyl bromides, a feature that has 
been found to be remarkably general. Litera- 
ture precedents indicate that this facile silane- 
mediated halogen abstraction mechanism should 
also function broadly with aryl bromides (k = 5 x 
10° M's to 1.1 x 10° M's” for bromobenzene 
with a range of silyl radicals) to generate the 
corresponding aryl radicals under mild condi- 


tions (31, 32). Given that aryl radicals are known 
to be captured by Cu(II) species at a rate ap- 
proaching diffusion (33, 34), we questioned 
whether this light-driven halogen abstraction 
mechanism might be exploited to forge Cu(IID- 
arene intermediates from a diverse range of aryl 
halides at room temperature (Fig. 1), effectively 
overcoming the copper oxidative addition prob- 
lem. The successful execution of this mechanistic 
concept would solve a long-standing challenge in 
copper catalysis, thereafter providing a general 
platform to catalytically access trifluoromethylarenes. 

Our proposed mechanism for the light- 
enabled trifluoromethylation protocol is out- 
lined in Fig. 2A. Upon exposure to visible light, 
photocatalyst Ir(dFFppy). (4,4°-dCF3bpy)PF¢ 
[dFFppy = 2-(2,4-difluorophenyl)-5-fluoropyridine; 
4,4°-dCF3bpy = 4,4 -di(trifluoromethyl)-2,2°- 
bipyridyl)] (J) should produce excited triplet 
state species 2. This highly oxidizing Ir com- 
plex [Ey."** (“Ir /Ir") = +1.55 V versus saturated 
calomel electrode (SCE) in CH3CN] can undergo 
single-electron transfer (SET) with tris-trimethylsilyl) 
silanol (4) [supersilanol, E, (4°°/4) = +1.54 V 
versus SCE in CH3CN] to give Ir(II) complex 3 
(35). We hypothesize that supersilanol 4, upon 
oxidation and deprotonation, would then un- 
dergo rapid silyl migration (Fig. 2B) to yield 
species 5 as the catalytically relevant silyl radical 
(36). Subsequent bromine atom abstraction from 


aryl bromide 6 would then yield the correspond- 
ing aryl radical 7. Concurrently, Ir(II) complex 
3 [Ey Ur /Ir") = -0.83 V versus SCE in 
CH3;CN] could facilitate the formation of the 
CF; radical (9) through a SET event with elec- 
trophilic CF; reagent 8 [E, (8/8") = -0.52 V 
versus SCE in CH3CN]. At this time, we believe 
the Cu(II)-CF; adduct (11) should be generated 
through interaction between the CF; radical (9) 
and the active Cu(I) complex (10), formed in situ 
by SET reduction of a Cu(ID) catalyst precursor 
by the photocatalyst. Aryl radical 7 should quickly 
undergo radical trapping by Cu(II) complex 11 to 
yield the key aryl-Cu(III)-CF; adduct 12 (37, 38). 
Reductive elimination from this high-valent Cu(II) 
intermediate would yield the desired product 13 
and simultaneously regenerate Cu(1) catalyst 10 (39). 

A survey of various combinations of differ- 
ent electrophilic CF; reagents, copper sources, 
and solvents revealed that blue light-emitting 
diode (LED) irradiation of a mixture of 4- 
bromobenzonitrile, supersilanol (4), dMesSCF, 
(8), CuBr.*2LiBr [20 mole % (mol %)], and IrdiII) 
photocatalyst 1 (0.25 mol %) in acetone at room 
temperature afforded the desired trifluoromethyl- 
arene in 89% yield (Fig. 2C). The use of super- 
silanol in place of hydrosilanes was found to be 
crucial, because it minimizes unproductive proto- 
dehalogenation of the bromoarene, which presum- 
ably proceeds through hydrogen atom abstraction 
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Fig. 2. Reaction design. (A) Proposed mechanism for the 
trifluoromethylation of aryl bromides with dual copper/photoredox 
catalysis. (B) Proposed mechanism for the generation of the 

active silyl radical. (©) Optimized conditions. (D) Reagent dMesSCF3 
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and its solid-state structure. Ellipsoids are drawn at 50% 
probability. The triflate counterion is omitted. Me, methyl; LED, 
light-emitting diode; SET, single-electron transfer; Mes, 
2,4,6-trimethylphenyl; OTf, trifluoromethanesulfonate. 
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by aryl radicals from weak Si-H bonds (37). In 
addition, a variety of diaryl sulfonium CF; salts 
were found to be compatible with the coupling 
protocol. However, the dimesityl sulfonium tri- 
flate salt (8) provided the highest efficiency of 
the desired product. We attribute this efficiency 
to suppression of unproductive CF; radical addi- 
tion to the corresponding diary] sulfide (40). No 
reactivity was observed between sulfonium 8 and 
a variety of Cu(I) and Cu(II) complexes, which 
further supports a photoredox-catalyzed gener- 
ation of a CF; radical. Electrophilic CF; reagent 8 
is a bench-stable, crystalline solid, which allows 
for ease in purification and handling (Fig. 2D). 
Moreover, optimization in the preparation proce- 
dure allowed for a one-step, scalable synthesis of 
this now-commercial trifluoromethylating reagent. 

With the optimized conditions in hand, we 
sought to examine the scope of the trifluoro- 
methylation protocol (Fig. 3). A broad range of 
electronically differentiated parasubstituted bro- 
moarenes were found to give good to excellent 
yields of the desired product (15 to 20, 78 to 96% 
yield). Functionalities with coupling capability, 
such as chloro and pinacolborato, remain intact, 
potentially allowing for subsequent orthogonal 
functionalization (17 and 19, 96 and 78% yield). 
Substituents at the meta and ortho positions 
were also well tolerated (21 and 22, 81 and 70% 
yield). Without the implementation of extensive 
optimization studies, the functionalization of dibro- 
mobenzenes provided the bis(trifluoromethyl)arene 
adduct product in good yield with the para-case 
and moderate to low yields with the ortho and 
meta adducts (figs. S28 to S30). The mono- 
trifluoromethylated product could be achieved 
in good yield for all cases using our standard 
conditions. Aryl bromides with fused cyclic mo- 
tifs were converted to the desired trifluoro- 
methylarenes in good yields (23 to 26, 75 to 
84% yield). Additional examples of bromoarene 
trifluoromethylation can be found in fig. S31. 
Regarding the scope of heteroarylbromides, 
pyridine-derived substrates were functionalized 
in good to excellent yields (27 to 36, 65 to 91% 
yield), and the trifluoromethylation of bromo- 
pyrazines, -pyrimidines, and -pyridazines was 
also accomplished in generally high efficiency 
(37 to 42, 64 to 91% yield). We were pleased to 
find that a wide range of five-membered hetero- 
aryl bromides, such as imidazoles, pyrazoles, and 
thiazoles, were competent substrates, providing 
the desired products in useful to good yields (43 
to 50, 38 to 64% yield). Finally, the compatibility 
of the trifluoromethylation protocol with bio- 
relevant molecules was also examined. Indeed, 
CF3-derivatives of Celebrex, Lopid, Skelaxin, and 
Rupatadine were readily synthesized from the 
corresponding aryl bromide precursors (51 to 
54, 42 to 77% yield). 

We next turned our attention to investigating 
the intermediacy of an aryl radical under this 
dual copper/photoredox platform. First, gener- 
ation of an aryl radical under copper-free con- 
ditions was examined by exposure of a mixture 
of bromoarene (6), supersilanol (4), dMesSCF, 
reagent (8), and photocatalyst (1) to blue LEDs 
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A Aryl radical formation under Cu-free conditions 
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Fig. 4. Evidence for an aryl radical mechanism. (A) Under copper-free conditions, silane- 
mediated generation of aryl radical was observed through hydrogen/deuterium trapped product and 
TEMPO-trapped adduct. (B) Similar behavior was observed under the optimized reaction condition 
when TEMPO was used as a radical trap. (C) Exclusive cyclized product formation, along with the 
observed diastereoselectivity, served as strong indications for an aryl radical mechanism being 
operative. TEMPO, (2,2,6,6-Tetramethylpiperidin-1-yl)oxyl; d.r., diastereomeric ratio. 


(Fig. 4A). When acetone was employed as the 
reaction medium, 66% yield of the protodehalo- 
genation product (55) was observed. This prod- 
uct is proposed to arise from a hydrogen atom 
transfer event between an aryl radical and a 
solvent molecule. When the same reaction was 
carried out in acetone-dg, a similar yield was 
observed with 53% deuterium incorporation at 
the para position (56, 51% yield). Furthermore, 
conducting the identical reaction in the presence 
of 2,2,6,6-tetramethylpiperidin-l-oxyl (TEMPO) 
resulted in complete shutdown of formation of 
the reduced product, and, more important, aryl- 
trapped TEMPO adduct 57 was isolated, further 
supporting the intermediacy of an aryl radical 


(41, 42). A similar result was observed when a 
TEMPO-trapping experiment was carried out 
with the optimized trifluoromethylation con- 
ditions. As shown in Fig. 4B, aryl-TEMPO adduct 
57 was isolated, whereas the formation of the 
desired trifluoromethylarene was completely 
suppressed. Control experiments indicated that 
the aryl radical formation is not mediated by 
direct SET reduction of the aryl bromide by 
the photocatalyst (fig. S41). To lend additional 
support to our proposed mechanism, we next 
employed a radical clock experiment with ortho- 
O-(3-methyl-allyl)bromobenzene (5:1 trans:cis mix- 
ture) (58) as the substrate (Fig. 4C). Subjecting 
the olefin mixture to the standard conditions 
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yielded exclusively the cyclized alkyl-CF; product 
in good yield (59, 60% yield). The 1-to-1 diaster- 
eomeric ratio of the cyclized product 59, along 
with the absence of the CF;-arene product 60, 
further support the hypothesis of aryl radical 
generation during the course of the reaction 
(33, 34). Previously reported copper-mediated 
aryl halide trifluoromethylation protocols did 
not exhibit aryl radical characteristics when sim- 
ilar radical clock experiments were used as a 
mechanistic probe (24, 25). In addition, whereas 
aryl radical generation has previously been shown 
to be operative under light-irradiated copper ca- 
talysis (33), the open-shell aryl species under 
our optimized conditions is generated by a silyl 
radical halogen abstraction, independent of the 
copper catalyst (43). 

Use of aryl radical trapping to circumvent the 
sluggish nature of conventional oxidative addition 
in copper catalysis has allowed for the devel- 
opment of a general dual copper/photoredox- 
catalyzed trifluoromethylation protocol for aryl 
bromides. We expect this method to be widely 
adopted by the synthetic community as a com- 
plementary tool to existing trifluoromethylation 
platforms. Perhaps most important, we see this 
halide abstraction/radical capture mechanism 
(at room temperature) as an approach to over- 
come the copper oxidative addition problem and 
in doing so provide a general paradigm to the 
development of many, previously unknown, copper- 
catalyzed coupling reactions. 
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Feedback regulation of COOLAIR 
expression controls seed dormancy 


and flowering time 


Min Chen and Steven Penfield* 


Plants integrate seasonal signals, including temperature and day length, to optimize the 
timing of developmental transitions. Seasonal sensing requires the activity of two proteins, 
FLOWERING LOCUS C (FLC) and FLOWERING LOCUS T (FT), that control certain 
developmental transitions in plants. During reproductive development, the mother plant 
uses FLC and FT to modulate progeny seed dormancy in response to temperature. We 
found that for regulation of seed dormancy, FLC and FT function in opposite configuration 
to how those same genes control time to flowering. For seed dormancy, FT regulates 
seed dormancy through FLC gene expression and regulates chromatin state by activating 
antisense FLC transcription. Thus, in Arabidopsis the same genes controlled in opposite 
format regulate flowering time and seed dormancy in response to the temperature 


changes that characterize seasons. 


n Arabidopsis, seasonal changes in temper- 

ature are sensed by the epigenetic state of the 

floral repressor gene FLOWERING LOCUS C 

(FLC) (1-3). Prolonged periods of cold pro- 

mote the expression of antisense transcripts 
at FLC known as COOLAIR, which facilitate si- 
lencing of FLC mRNA transcription via recruit- 
ment of Polycomb repressive complex 2 (PRC2) 
and deposition of the trimethylated histone H3 
Lys”” (H3K27me3) chromatin mark (4-6). A key 
target of FLC is FLOWERING LOCUS T (FT), and 
FT and FLC are believed to act sequentially to 
deliver seasonal information that promotes flow- 
ering (7-9). 

After flowering, FLC and FT control plant ar- 
chitecture and progeny seed dormancy, the latter 
by regulation of seed coat development and seed 
hormone levels (10-15). Temperature changes 
applied to plants before buds are visible affect 
progeny seed dormancy in a process that requires 
maternal FLC and FT, which pass seasonal in- 
formation to progeny (72). In this way, the mother 
plant exploits environmental temperature vari- 
ation to create diversity in progeny seed behavior, 
which is important for bet-hedging reproductive 
strategies (15). Here, we investigated how FT 
controls seed dormancy. 

To repress FT expression, FLC forms heter- 
odimers with the transcription factor SHORT 
VEGETATIVE PHASE (SVP) (16, 7). To determine 
the genetic architecture of maternal temperature 
signaling pathways controlling seed dormancy, 
we exposed ft-1 sup-3 and ft-1 flc-21 double mutant 
rosettes to high and low ambient temperatures 
and analyzed the effect on progeny dormancy 
relative to the wild type and single mutants 
(Fig. 1A). Restricting temperature treatments 
to the pre-anthesis period enables dormancy 
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outcomes to be ascribed solely to parental re- 
sponses (12). Genetic interactions showed that 
the effect of SVP requires FT, because mutation 
of FT suppressed the high germination of sup-3 
mutants (Fig. 1A). However, ft-1 flc-21 double 
mutant plants produced seeds whose germina- 
tion resembled that of flc-21, and jflc-21 reversed 
the stronger dormancy of ft-7 mutants (Fig. 1A). 
Therefore, in the context of seed dormancy, the 
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FLC locus acts genetically downstream of FT. 
To test whether FT requires maternal FLC for 
dormancy control, we made reciprocal crosses 
between jt-1 and ft-1 flc-21 double mutants. Ma- 
ternal FLC was required for seed dormancy 
control by FT (fig. SIA) (JO). FT is required for 
temperature regulation of seed coat development 
(12). Indeed, flc-21 prevented increases in seed 
coat tannin content and decreases in seed coat 
permeability observed in ft mutant fruit tissue, 
and reversed the up-regulation of transcription 
factors controlling seed coat tannin deposition 
in ft-1 (Fig. 1, B and C, and fig. S1, B and C). 
Next, we tested whether FT controls FLC ex- 
pression, focusing our analysis on female repro- 
ductive tissues. Relative to 16°C conditions, warm 
conditions (22°C) down-regulated FLC mRNA 
expression in wild-type carpels, as it does in 
leaves (18). Loss of FT strongly reduced the mag- 
nitude of this response (Fig. 2A). To locate the 
tissues in which FT regulates FLC, we crossed 
a translational GUS fusion containing the entire 
FLC locus (3) (FLC-GUS) into ft-1 and analyzed 
the expression in reproductive tissues. In flowers 
at 22°C, FLC is expressed only in the stamen of 
wild-type plants, but in the /t-17 mutant we ob- 
served ectopic expression of FLC-GUS in carpels 
(Fig. 2, B and C). We also tested the effect of 
ft-1 on antisense COOLAIR expression in carpels 
(Fig. 2D). COOLAIR is transcribed into both short 
(proximal) and longer (distal) transcripts (19). We 
found that at 22°C, levels of distal COOLAIR 
were reduced in ft-1 but levels of proximal 
COOLAIR transcript were unchanged (Fig. 2D). 
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Fig. 1. FT acts upstream of FLC to regulate seed dormancy. (A) Germination of seeds from 
mother plants subjected to either 22° or 16°C before anthesis. Error bars represent SE 

(n = 5 biological replicates). (B) Insoluble tannin content in wild-type (Ler), ft-1, flc-21, 

and ft-1 flc-21 double mutant seeds. Error bars represent SE (n = 5 biological replicates). 

(C) Quantitative reverse transcription PCR (qRT-PCR) shows that FLC is required for up-regulation 
of regulators of tannin synthesis TRANSPARENT TESTA 12 (TT12) and TT16 in ft-1. Error bars 
represent SE (n = 3 biological replicates). *P < 0.05 (difference from wild type). 
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Therefore, FT influences the balance of sense 
and antisense gene expression at the FLC locus. 
Further analysis of FLC:GUS in seed tissues dur- 
ing seed maturation showed no effect of FT on 
FLC:GUS expression in seeds themselves (fig. S2), 
consistent with the maternal action of FLC in 
seed dormancy control; this finding suggests that 
FT and FLC act to control dormancy either dur- 
ing carpel development or shortly after fertilization. 

Because of the differences in FLC expression 
between wild-type and ft-1 mutants, we used FT 
tagged with green fluorescent protein (GFP) to 
test whether FLC is a direct target of FT. Plants 
expressing FT-GFP from the phloem-specific SUC2 
promoter (9) were used to determine whether 
FT-GFP could be detected at the FLC locus by 
chromatin immunoprecipitation (ChIP). This 


> 
ow 


Relative FLC expression 


0.001 


Fig. 2. Maternal temperature experience and FT affect FLC expres- 
sion in carpels. (A) FLC mRNA transcripts in carpel tissue harvested 
from plants grown at either 16° or 22°C. Error bars represent SE 

(n = 3 biological replicates). (B) FLC-GUS expression in the stage 12 
floral organs of representative Ler and ft-1 flowers. Scale bar, 1 mm. 


construct is functional and complements the late- 
flowering ft-7 mutation, and also increases ger- 
mination of ft-7 seeds (fig. S3). We could detect 
enrichment of FT-GFP toward the last exon and 
3' region of the FLC locus, corresponding to the 
promoter and first exon of COOLAIR transcripts 
(Fig. 3A) (4, 5). Given that COOLAIR transcripts 
silence FLC mRNA expression (5), we asked 
whether FT affects the temperature-regulation 
of FLC mRNA levels by promoting COOLAIR 
transcription. We used strand-specific quantitative 
reverse transcription polymerase chain reaction 
(qRT-PCR) to compare COOLAIR expression in 
wild-type and ft-/ siliques produced on plants that 
began reproductive development at either 22° or 
16°C. We found no evidence that either temper- 
ature or FT regulated proximal COOLAIR tran- 


ft-1 isolated carpels a 


*P < 0.05 (difference 


scripts (Fig. 3B). However, 22°C increased distal 
COOLAIR levels in wild-type fruits compared to 
16°C, and this effect required FT (Fig. 3C). This 
down-regulation of distal COOLAIR in ft-1 sug- 
gested that this transcript opposed FLC mRNA 
expression (20), which is higher in the ft-1 mutant 
than in the wild type (Fig. 2). 

We next tested whether the FT protein can 
activate transcription from an isolated COOLAIR 
promoter fused to the LUCIFERASE (LUC) re- 
porter. After introgressing the COOLAIR:LUC 
transgene from Ler to fi-J, we compared LUC ac- 
tivity between wild-type plants and /t-] mutants. In 
the wild type, we could detect COOLAIR:LUC ac- 
tivity in whole plants, carpels, and fruits, with LUC 
activity in fruits declining during fruit and seed 
maturation (Fig. 3D). In f-1 plants, COOLAIR:LUC 
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(C) Quantitative measurement of FLC-GUS activity in wild-type and 


t 22°C. Error bars represent SE (n = 3 biological 


replicates). (D) COOLAIR transcripts in wild-type and ft-1 carpels 
at 22°C. Error bars represent SE (n = 3 biological replicates). 
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Fig. 3. Maternal temperature experience affects FLC antisense 
transcription in fruit tissue. (A) FT-GFP ChIP performed across the FLC 
locus in silique tissues. Data are represented relative to the negative control 
ACTIN2. Numbers indicate positions of primers used. Error bars represent 
SE (n = 3 biological replicates). *P < 0.05 (difference from wild type). 

(B and C) Proximal and distal COOLAIR levels in silique tissues from plants 
growing at either 16° or 22°C. Error bars represent SE (n = 3 biological 
replicates). Primers used are indicated by arrows in (A). *P < 0.05 
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(difference from wild type). (D) COOLAIR:LUC expression imaged in wild- 
type and ft-1 seedlings, carpels, and developing siliques from 3 to 14 days 
after pollination (DAP; left to right). Scale bars, 5 mm (seedlings and 
siliques), 1 mm (carpels). (E) Quantitative analysis of COOLAIR:LUC 
expression at 22°C in the indicated organs in long days, and the effect of 
16-hour days (LD) and 8-hour days (SD) on COOLAIR:LUC expression in 
Ler seedlings. Data are means + SE (n = 5 biological replicates). P values 
were calculated by two-tailed Student t test between genotypes or treatments. 
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Fig. 4. FT affects FLC chromatin state in leaves and siliques. (A) ChIP 
comparing Ler and levels of H3K27me3 and H3K36me3 across the FLC 
locus in silique and ft-1 leaf tissues. Data are means + SE (n = 2 biological 
replicates). (B) The effect of vernalization (V) at 4°C for the indicated 
number of weeks (W) on distal COOLAIR transcript levels in Ler and ft-1. 
Data are means + SE (n = 3 biological replicates). *P < 0.05 (difference 
from Ler control levels without vernalization). (©) GUS activity from 
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FLC:GUS-expressing Ler and ft-1 plants in seedlings without vernalization 
(-V) versus 7 days after vernalization for 4 weeks (4W V+7). Data are means 
+ SE (n = 3 biological replicates). *P < 0.05 (difference from Ler control 
levels without vernalization). (D) Rosette and cauline leaf numbers at 
flowering in Ler and ft-1 plants grown at either 22° or 16°C with or without 
prior vernalization. Data are means + SE (n = 5 biological replicates). 
Significant differences in response to vernalization are indicated. 


expression was reduced to 20 to 30% of wild- 
type levels in all organs tested (Fig. 3, D and E). 
Moving wild-type COOLAIR:LUC plants from 
long days to short days reduced LUC expression 
(Fig. 3E), consistent with the known role of FT 
in photoperiod responses (9). Because FT bound 
the FLC locus close to the COOLAIR transcrip- 
tion start site, and because FT was required for 
COOLAIR promoter activity in plants grown at 
22°C in long days, we conclude that FT sup- 
presses FLC mRNA expression by activating 
COOLAIR transcription. 

During vernalization, changes in COOLAIR 
expression precede stable chromatin state changes 
at FLC (4, 5). To test whether FT regulation of 
FLC is chromatin-based, we analyzed histone 
methylation at the FLC locus in wild-type and 
ft-1 plants at 22°C. Wild-type plants showed more 
H3K27me3 at the nucleation region (6) in siliques 
and leaves and less H3K36me3 than did ft-1 plants 
(Fig. 4A). This shows that FT can switch FLC be- 
tween active and passive transcription states. The 
effect of ft-1 on FLC expression in leaves could be 
overcome by vernalization (Fig. 4, B and C). We 
also found that an FLC-dependent effect of FT 
on flowering time can be observed if plants are 
grown at 16°C (Fig. 4D). The ft-7 mutant is later- 
flowering at 16°C than at 22°C, and this effect 
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can also be overcome by vernalization. This is 
reminiscent of fca and fy mutants, which are also 
affected in COOLAIR processing (21). Therefore, 
the feedback regulation of FLC by FT has a gen- 
eral relevance, controlling both flowering time 
and seed dormancy processes. 
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Predator-driven natural selection on 
risk-taking behavior in anole lizards 


Oriol Lapiedra,’* Thomas W. Schoener,” Manuel Leal,” 


Jonathan B. Losos,'{+ Jason J. Kolbe*+ 


Biologists have long debated the role of behavior in evolution, yet understanding of its role 
as a driver of adaptation is hampered by the scarcity of experimental studies of natural 
selection on behavior in nature. After showing that individual Anolis sagrei lizards vary 
consistently in risk-taking behaviors, we experimentally established populations on eight 
small islands either with or without Leiocephalus carinatus, a major ground predator. 

We found that selection predictably favors different risk-taking behaviors under different 
treatments: Exploratory behavior is favored in the absence of predators, whereas 
avoidance of the ground is favored in their presence. On predator islands, selection 

on behavior is stronger than selection on morphology, whereas the opposite holds on 
islands without predators. Our field experiment demonstrates that selection can shape 
behavioral traits, paving the way toward adaptation to varying environmental contexts. 


nderstanding the role of behavior in ad- 
aptation of animals to new environmental 
circumstances remains a major challenge 
in biology. Research has long addressed 
the debate about whether behavior spurs 
or impedes evolution (J-3) on phenotypic dimen- 
sions such as morphology (4, 5) or physiology 
(6). To unravel the process by which behavior 
shapes adaptation, we must examine how nat- 
ural selection operates among individuals in a 
population (7, 8). Recent growth in the study 
of interindividual variation in behavior (9-11) 
has revealed that behavior often varies consist- 
ently among individuals within a population 
(12). These studies have also suggested that 
this variation has fitness consequences (13-16). 
These results set the stage to investigate the 
hypothesis that natural selection on inter- 
individual variation in behavior could drive dif- 
ferent ecological and evolutionary trajectories 
for populations under distinct selective regimes 
(7, 8, 17-20). Assessing a hypothesis like this 
one under natural conditions requires controlled 
experiments in which natural selection is quanti- 
fied under contrasting selective regimes generated 
by manipulating well-known selective pressures 
(21). Here, we used small Caribbean islands as 
replicates to test directly whether and how nat- 
ural selection operates on lizards with different 
behaviors and morphologies under different 
selective regimes. 
We conducted this experiment on a well-studied 
predator-prey system involving the small lizard 
Anolis sagrei—commonly found on or near the 
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ground (22, 23)—and its ground-dwelling pred- 
ator, the larger lizard Leiocephalus carinatus (24) 
(Fig. 1A). We focused on individual variation in 
two behaviors of A. sagrei (Fig. 1B) that are con- 
sistently repeatable across time and in different 
contexts within individuals of this species (25) 
(see repeatability scores from this study in table S1). 
Specifically, we measured the rapidity of indi- 
viduals to explore new and potentially dangerous 


Fig. 1. Assessment of risk-taking A 
behavior and morphological 
characterization of A. sagrei 
individuals. (A) Anolis sagrei (left) 

and Leiocephalus carinatus (right) 
photographed on the experimental 
islands. (B) Experimental assess- 

ment of behavioral traits (26). 


environments and the time individuals spend on 
the ground and are thereby potentially exposed 
to ground-dwelling predators (26). The ecolog- 
ical relevance of these risk-taking behaviors in 
A. sagrei is illustrated by a simple cost-benefit 
trade-off (27, 28). Anolis sagrei individuals more 
willing to explore new environments should 
survive better in the absence of terrestrial 
predators (17) because they are more likely to 
obtain resources. In contrast, A. sagrei indi- 
viduals that spend more time exposed on the 
ground are more vulnerable to ground pred- 
ators as compared with individuals that spend 
less time exposed on the ground (22, 25, 29). 
Previous studies have reported differences in 
habitat use and modulation of social signals in 
A. sagrei populations in the presence or absence 
of L. carinatus (30, 31), leading us to hypothe- 
size that variation in risk-taking behavior might 
be adaptive. 

To experimentally examine natural selection 
on these risk-taking behaviors under natural con- 
ditions, we translocated 273 adult A. sagrei indi- 
viduals onto eight small islands in the Bahamas 
(fig. S2). Lizards were captured from source is- 
lands in the study area that generally have higher 
vegetation and host more complex biological 
communities (32) than our experimental islands, 
which have scrubbier, shorter vegetation and do 
not support resident populations of any known 
lizard predator (see table S2). 

Before translocation, we used outdoor labo- 
ratory behavioral assays [following (25, 26)] to 


Following (25), an A. sagrei indi- B 
vidual was gently placed into a 

wooden refuge inside a butterfly 

cage. During a 3-min habituation 
period, we placed a clear plastic 

cage that contained a live adult 

L. carinatus between the refuge 


and a natural perch. Then, we 
remotely opened the door of the 
refuge and the A. sagrei individual 
was able to see the predator for 
5 min (1). At the end of this period, 
we closed the door of the refuge 
and removed the plastic container 
with the L. carinatus from the 


experimental cage. After another 


5-min habituation period, we again opened the refuge cover and measured the “time to initiation of 
exploration in a new environment” (2), defined as the time interval between the time we opened the 
refuge cover and the time when the lizard started exploring the experimental cage by poking its head out 
of the refuge. We defined “time exposed on the ground” as the interval between the “exposed time 
start” (3)—defined as the time when the experimental lizard went out of the refuge (i.e., all its body, 
excluding the tail)—and the “exposed time end” (4), the time when the lizard either climbed the perch 
or hid underneath the rocks. See (26) for details. (C) Example of an x-ray image from which we measured 
the morphological traits in this study (i.e., SVL and hindlimb length). 
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characterize interindividual variation in two be- 
haviors known to consistently vary among indi- 
viduals (25) (table S1). After being exposed to the 
presence of an individual L. carinatus (position 
1in Fig. 1B), “time to initiation of exploration in a 
new environment” was defined as the amount of 
time until the lizard started exploring the experi- 
mental cage by poking its head out of the wooden 
refuge (position 2 in Fig. 1B). “Time exposed on 
the ground” corresponded to the interval of time 
during which the lizard was out of the refuge 
(position 3 in Fig. 1B) until it climbed on the perch 
or hid underneath the rocks (position 4 in Fig. 1B). 
Each lizard was x-rayed (Fig. 1C) and individually 
tagged before translocation onto experimental 
islands. We randomly assigned individuals to 
islands. Each island received lizards in propor- 
tion to its vegetated area, which was determined 
by conducting vegetation transects [following 
(26, 32)]. One week later, we added L. carinatus 
on four randomly selected islands, while the 
other four islands remained as predator-free 
controls. Four months later, we recaptured liz- 
ards on each of the experimental islands and 
identified surviving adult lizards from their in- 
dividually unique subcutaneous tags. 

On the basis of recapture data, we found that 
survival was lower on predator islands than on 
predator-free islands (mixed-effects model includ- 
ing island ID as a random factor and modeled 
following the binomial distribution; P < 0.001; 


Fig. 2. Association between individual 
variation in behavior and survival of 

A. sagrei females after the 4-month 
experimental period. (A and B) Time to 
initiation of exploration in a new environment 
A) and time exposed on the ground (B) are 
represented separately for predator-free 
versus predator islands. Solid lines represent 
the fitted model logistic regression; dashed 
ines denote 95% confidence intervals. Dots 
represent individual values for both survivors 
dots at top of each panel) and nonsurvivors 
dots at bottom of each panel); darker dots 
are indicative of several individuals having 
similar values. See table S4 for a representation 
of results pooling both sexes together. 
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fig. S3A). We also observed that A. sagrei from 
predator islands used the ground less frequently 
(16.9% of observations) than those from predator- 
free islands (41.4% of observations), and mean 
perch height was more than twice as high on 
predator islands (33.9 cm) than on predator- 
free islands (14.4 cm) (¢ = -4.9, df = 102.5, P< 
0.001; fig. S3B). 

Because A. sagrei is a sexually dimorphic spe- 
cies in which males and females differ in both 
morphology and behavior (23, 33) (figs. S8 and S9), 
we hypothesized that natural selection on inter- 
individual variation in behavior could operate 
differently between sexes under different envi- 
ronmental conditions. On predator-free islands, 
natural selection favored females that took less 
time to initiation of exploration in the exper- 
imental trials conducted before release—a pat- 
tern not observed on predator islands (Fig. 2A). 
On predator islands, females that spent less time 
exposed on the ground had a greater chance of 
survival (Fig. 2B). Behavior was not a significant 
predictor of survival for males (fig. S3A). Whether 
or not A. sagrei were initially captured from is- 
lands with L. carinatus present did not signifi- 
cantly affect their chances of survival during the 
experiment (table S3). 

That we only found significant selection on 
time spent on the ground on predator islands 
for females, but not males, suggests a greater 
effect from predatory lizards on females than 
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on males. In support of this possibility, female 
mortality was higher on predator islands than 
on predator-free ones (2 = 9.7, P = 0.002), 
whereas for males there was no difference (y” = 
2.9, P = 0.086; Fig. 3A). In addition, on predator 
islands, use of the ground was also lower in fe- 
males than in males (11.9% versus 22.9%, respec- 
tively; y” = 41.9, P < 0.001; Fig. 3B). Because 
A. sagrei feeds primarily on the ground (23), the 
observed patterns of ground use suggest that fe- 
males could be having more difficulties obtaining 
food resources on the predator islands. Indeed, 
4 months after experimental translocation, fe- 
males were in poorer body condition on pred- 
ator islands than on predator-free islands (P < 
0.001), a pattern not observed in males (P = 0.68) 
(fig. S3). Together, these results suggest that dif- 
ferences in habitat use between sexes influence 
natural selection on behavioral traits. 

A long-standing debate in evolutionary biol- 
ogy concerns the association between behavioral 
and morphological evolution (J, 2, 34). Our study 
design allowed us to investigate whether selec- 
tion on morphological traits occurs simulta- 
neously with selection on behavioral traits and 
to assess whether selection on both phenotypic 
dimensions was correlated. Specifically, we tested 
a well-established morphological pattern in Anolis 
lizards: that the use of the ground or other broad 
surfaces favors longer limbs, which provide greater 
sprinting abilities [reviewed in (23)]. We found 
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Fig. 3. Comparison of survival frequencies and habitat use between sexes and experimental 
treatments. (A) The proportion of females surviving was higher on predator-free islands than on 
predator islands, but this difference was marginally nonsignificant for males. Error bars indicate 
SEM. (B) Both sexes used the ground less on predator islands, but this difference was greater for 


females than for males. 


Table 1. Best mixed-effects models describing female survival on the experimental islands. 


Estimate SE Zz P value 


Predator-free islands (n = 63) 


=1L085} 04 


= 2155) 


Random effects Island 


Body size (SVL) 


that females with longer hindlimbs relative to 
their body size survived better than shorter- 
limbed individuals on predator-free islands (P = 
0.002; Table 1 and fig. S6). This is consistent with 
our observation that females used the ground 
more often on predator-free islands than on 
predator islands (Fig. 3B). On predator islands, 
relative hindlimb length did not affect survival 
(P = 0.26; fig. S6). We did not find selection on 
the relative hindlimb length for males (P > 0.80) 
in either experimental treatment. In addition, 
we found that on predator islands, smaller fe- 
males survived better than larger females (P = 
0.013; Table 1). Finally, selection on behavior and 
morphology was not correlated. For females 
from predator-free islands, selection for longer 
hindlimbs was independent of selection for in- 
creased exploratory behavior (shown by the lack 
of a significant interaction term in mixed models 
shown in Table 1). On predator islands, selection 
for smaller females was also independent of se- 
lection favoring individuals that spent less time 
exposed on the ground (Table 1). Overall, these 
results indicate that natural selection on behav- 
ior can occur simultaneously with, and indepen- 
dent of, selection on morphology. 

Given that selection operated on both mor- 
phology and behavior, we asked which of these 
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0.14 -2.48 


factors explained a higher proportion of the var- 
iation in mortality in females (no significant fac- 
tors were detected in males). An analysis of the 
proportion of variation in mortality explained by 
behavior versus morphology (26) revealed that 
on predator-free islands, selection on hindlimb 
length explained 19.1% of the variation in mor- 
tality, whereas selection for more exploratory 
females accounted for 13.9%. Conversely, on 
predator islands, the proportion of variance 
in mortality explained by time exposed on the 
ground was 22.5%, whereas body size [snout- 
vent length (SVL)] accounted for 9.8%. These 
findings suggest that although both behavior 
and morphology can simultaneously contribute 
to survival, their importance is context-dependent 
and varies under different selective regimes. 
Although behavior largely defines how ani- 
mals interact with the environment, the evolu- 
tionary consequences of interindividual variation 
in behavior remain largely unknown (7, 8). Our 
replicated field study provides evidence that 
natural selection operates differently on inter- 
individual variation in behavior under different, 
experimentally manipulated selective pressures. 
Moreover, our results indicate that differences 
in habitat use between sexes likely influence the 
strength of natural selection on behavioral traits. 
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By showing that selection can simultaneously 
and independently operate on behavior and mor- 
phology, we demonstrate that rapid environmental 
changes can shape different phenotypic dimen- 
sions at the same time; the evolutionary outcome 
of such selection will depend on the genetic basis 
of these traits and the extent to which they are 
correlated. Our results thus underscore the need 
to explicitly integrate interindividual variation in 
behavior as a relevant phenotypic dimension in 
studies of adaptation (7, 8, 35). Moreover, we 
found that under increased predation pressure, 
behavior is a more important factor explaining 
survival than the morphological traits that have 
been the subject of previous investigation (22); 
the extent to which these results can be gener- 
alized across species remains to be determined. 
Our results demonstrate that consistent behav- 
ioral variation among individuals can be an im- 
portant focus of selection when populations 
experience novel environmental conditions—an 
increasingly common situation in the current 
context of global change. 
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The effect of partisanship and 
political advertising on close 


family ties 


M. Keith Chen’*}+ and Ryne Rohla?* 


Research on growing American political polarization and antipathy primarily studies public 
institutions and political processes, ignoring private effects, including strained family 
ties. Using anonymized smartphone-location data and precinct-level voting, we show that 
Thanksgiving dinners attended by residents from opposing-party precincts were 30 to 

50 minutes shorter than same-party dinners. This decline from a mean of 257 minutes 
survives extensive spatial and demographic controls. Reductions in the duration of 
Thanksgiving dinner in 2016 tripled for travelers from media markets with heavy political 
advertising—an effect not observed in 2015—implying a relationship to election-related 
behavior. Effects appear asymmetric: Although fewer Democratic-precinct residents 
traveled in 2016 than in 2015, Republican-precinct residents shortened their Thanksgiving 
dinners by more minutes in response to political differences. Nationwide, 34 million hours of 
cross-partisan Thanksgiving dinner discourse were lost in 2016 owing to partisan effects. 


merican political partisanship has risen 

sharply over the past 25 years. More than 

55% of Democrats and Republicans de- 

scribed “very unfavorable” feelings toward 

the opposing party in 2016, up from 17 to 
21% in the mid-1990s; growing numbers of In- 
dependents express disfavor with both parties, 
and rising party defections increase polarization 
(1). Spatial partisan sorting produces increas- 
ingly homogeneous electoral “bubbles” at both 
state and local levels (2), and political minorities 
within these bubbles show reticence to partici- 
pate in or reveal their party affiliation (3). 

Animosity toward political rivals is not lim- 
ited to the ballot box; implicit partisan biases 
manifest in discriminatory decisions even more 
frequently than racial or gender biases (4). Par- 
ents express intolerance of their children dating 
and marrying across partisan lines (5), and ob- 
served dating and marital choices segregate more 
strongly on politics than on physical attributes 
or personality characteristics (6). Political polar- 
ization affects decisions, such as where to work 
and shop, at higher rates than race, ethnicity, or 
religion (7). 

We study whether politics strain close family 
ties by measuring family-gathering durations. 
After the historically divisive 2016 presidential 
election, 39% of American families avoided po- 
litical conversations during the holidays, an aver- 
sion that spanned both party and socioeconomic 
lines (8). We examine Thanksgiving, which, in 
U.S. election years, may bring together family 
members with differing political views just weeks 
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after votes are cast. Anecdotal evidence suggests 
that, in the wake of the 2016 election, many fam- 
ilies canceled or otherwise cut short Thanksgiv- 
ing plans with their most politically problematic 
relatives (9). 

Several cognitive biases in social and political 
psychology explain why individuals might limit 
such interactions. A “partisan selective exposure” 
motivation occurs when individuals avoid counter- 
attitudinal political information that might engen- 
der cognitive dissonance or harm relationships 
(0). Numerous studies find “belief polarization,” 
whereby individuals gravitate toward more ex- 
treme versions of their own initial positions 
during discussion of political issues (71). Exac- 
erbating this effect, individuals also incorrectly 
expect others to respond to discussion and de- 
bate in the same direction as their own response, 
anticipating belief convergence rather than polar- 
ization (12), and attribute a lack of convergence to 
the bias and irrationality of others, while viewing 
themselves and copartisans as less ideological 
than cross-partisans (13). Our study examines 
whether these effects, which are well-studied 
in experimental settings among strangers, extend 
to close family gatherings. 

We analyze how political differences affect 
the duration of Thanksgiving dinner by merging 
two datasets. Anonymized smartphone-location 
data from more than 10 million Americans allow 
observation of actual travel at extremely precise 
spatial and temporal levels. We combine this 
with a precinct-level database for the 2016 elec- 
tion to impute presidential voting at the finest 
spatial resolution possible. By comparing vote 
shares in an individual’s home and Thanks- 
giving destination precincts, we test the relation- 
ship between political disagreement and time 
expenditure. 

To isolate the particular effect of election-year 
political partisanship from a multitude of demo- 


graphic and spatial confounds, we construct 
comparison sets of smartphone users that share 
the same home-destination pairs. Our measured 
effects are neither eliminated nor attenuated by 
comparing only matched users, suggesting that 
the measured time loss is not an artifact of po- 
litically correlated demographics or spatial sort- 
ing. Furthermore, because political advertising 
polarizes opinions (/4) and heightens dislike for 
opposing parties (15), we compare partisan rifts 
between comparable users who fall just on op- 
posite sides of media-market boundaries. Account- 
ing for political advertising more than tripled 
our measured “Thanksgiving effect” in 2016, but 
not in 2015, before ads were run. This noneffect 
of yet-to-be-run ads acts as a political placebo 
test, further bolstering the argument that our 
measured Thanksgiving losses stem from polit- 
ical partisanship rather than from preexisting 
demographic or spatial confounds. 

We collect precinct-level results for the 2016 
presidential election through internet scrap- 
ing and by contacting secretaries of state, boards 
of election, and individual county clerks via 
email, phone, or fax or in person. Finally, we 
match vote totals to precinct polygonal shape- 
files using Geographic Information Systems (GIS) 
software. The resulting dataset covers 172,098 
precincts across 99.9% of counties nationally 
(Fig. 1A). 

Political advertising data are from Kantar 
Media’s Campaign Media Analysis Group (J6) 
and count every U.S. presidential television ad 
aired in all 210 Nielsen Designated Market Areas 
after 12 June 2016, including ads purchased di- 
rectly by campaigns or outside groups such as 
political action committees. Data from the 2010 
Decennial Census and the Census Bureau’s 2012- 
2015 American Community Survey form the basis 
of demographic controls. 

Location data rely on numerous smartphone 
apps and were aggregated by SafeGraph, a com- 
pany that builds and maintains anonymized 
geospatial datasets for more than 10 million 
U.S. smartphones. These data consist of “pings,” 
each identifying the coordinates of a particular 
smartphone at a moment in time. Our primary 
analysis includes 21 billion pings from November 
2016 and 4.5 billion from November 2015. 

To merge datasets, we infer the precinct and 
census block of each smartphone user’s “home” 
on the basis of that user’s pings between 1:00 a.m. 
and 4:00 a.m. over the 3 weeks before Thanks- 
giving. This procedure identifies more than 6 
million approximate home locations in November 
2016 (Fig. 1B), which we then link with precinct- 
level two-party vote shares and census demograph- 
ics. Similarly, a user’s Thanksgiving location is 
based on their modal location between 1:00 p.m. 
and 5:00 p.m. (24 November 2016 and 26 November 
2015). 

By construction, this sample is representative 
of the 77% of Americans who own smartphones, 
raising the question of whether our sample is 
politically representative of the American elec- 
torate as a whole. We test this by assigning to 
each resident a vote ratio proportional to the 
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Fig. 1. Sampling and imputation validation. 

(A) Results of the 2016 U.S. presidential election by 
precinct (excludes unpopulated census blocks). 

(B) Home locations of smartphone users in the 2016 
sample. (C) Correlation between actual two-party 
vote share by state and the District of Columbia (DC) 
(x axis) and predicted vote share (y axis) using each 
smartphone user’s home precinct. Nationally, this 
predicts a 0.516 Clinton vote share, compared to an 
actual vote share of 0.511. Highlighted are the two 
most Democratic-leaning states (California and 
Massachusetts) and two most Republican-leaning 
states (Wyoming and West Virginia), as well as the 
states with the largest prediction error. 


Table 1. Effect of political mismatch on Thanksgiving dinner duration. Each regression (column) estimates the effect of voting disagreement on 2016 
Thanksgiving dinner duration. All results use linear regressions with fixed effects controlling for an individual’s home location—cross—Thanksgiving 
destination. Stepwise regressions control for progressively finer pairs, culminating in a five-digit geohash, a square grid about 3 miles by 3 miles in size. The 
mean duration of Thanksgiving dinner was 257 min (SD = 162 min). The average probability of voting mismatch was 0.44 (SD = 0.10). Standard errors are 
reported in parentheses and clustered at the precinct-cross-precinct level. R*, coefficient of determination. ***P < 0.001; **P < 0.01. 


Independent variable 1 


Prete of political mismatch (2.226) 


(2.952) 


(8.696) 


(14.55) 


Number of fixed-effects groups ) 


2016 two-party vote share of their home pre- 
cinct. A resident of a precinct that recorded 150 
Clinton and 50 Trump votes, for example, would 
be assigned 0.75 Clinton and 0.25 Trump votes. 
Figure 1C compares these votes against actual 
2016 two-party vote shares for each state and 
Washington, D.C. The 45° line represents where 
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states would lie if the SafeGraph sample politi- 
cally matched the distribution of American voters. 
Our imputed votes are accurate to within 1 per- 
centage point in 33 states and within 5 percentage 
points in all states. Nationally, the data suggest a 
two-party Democratic vote share of 0.516, com- 
pared to the actual share of 0.511. 


302,716 


414,950 


We first examine whether, conditional on 
traveling for Thanksgiving dinner, the partisan 
distance between a home and destination affects 
that dinner’s duration. We restrict our sample 
to residents who were home both in the morn- 
ing and during the night of Thanksgiving, but 
who traveled for Thanksgiving dinner, to focus 
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Table 2. Asymmetric effects of political mismatch. Each regression (column) estimates the 
effect of voting disagreement between travelers and hosts [DPR traveler to RPR host (DPR->RPR) 
and RPR traveler to DPR host (RPR+DPR)] on 2016 Thanksgiving dinner duration. The F-test P value 
tests for equality between coefficients, to test for an asymmetric mismatch effect. The average 
probability of DPRs attending a RPR-hosted dinner and vice versa was 0.221 and 0.215, respectively 
(for both, SD = 0.10). Standard errors are reported in parentheses and clustered at the precinct- 
cross-precinct level. ***P < 0.001; **P < 0.01; *P < 0.05. 


Independent variable 1 


2 3 4 


Dependent variable: Duration of Thanksgiving dinner (min) 


Probability DPR+RPR 


Probability RPR+DPR 


fi 
Observations 
R 


642,962 


Fixed effects 
Number of fixed-effects groups fe) 


642,962 


35,507 


302,716 414,950 


Table 3. Political advertising heightens partisan effects. Each regression (column) estimates 
the effect of voting disagreement between travelers and hosts on 2016 Thanksgiving dinner 
duration. The second and fourth regressions explore whether political advertising heightens these 
effects. Media markets in swing states like Florida saw more than 26,000 ads in 2016. Standard 
errors are reported in parentheses and clustered at the precinct-cross-precinct level. A blank cell 
indicates that the variable was not included in this regression. ***P < 0.001; *P < 0.05. 


Independent variable 1 


2 3 4 


Dependent variable: Duration of Thanksgiving dinner (min) 


Sill sigh" 
Probability of political mismatch 


Probability DPR+RPR 


Number of political ads 
(1000 ads per market) 


Probability of political mismatch 
times number of political ads 


Probability DPR+RPR times number 
of political ads 


Probability RPR+DPR times number 
of political ads 


our analysis on travelers who could control the 
duration of their visits. In Table 1, we estimate 
the following equation: 

duration, = 0 + B mismatchy + yFy + ey 
where 


mismatch = Pl = Pj) oF ad = P,P; 


In this specification, dwrationy is the number of 
minutes traveler 7 spent with host j on Thanks- 
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-14.40"** 


(2.454) (2.879) 


ee eee et ea ce 
(2.555) (2.978) 

a eaaeey 
(0.185) (0.185) 

ee Spagna 

(0.393) 

(0.417) 

Fase g Bee enero ee rear n ene (0.439) | 
642,962 642,962 642,962 

eres etn ere at 


giving, Fi is a set of fixed effects that form 
groups of people defined by pairs of home 
(7) and destination (J) locations, and f is the 
coefficient of interest. P; and P; are the two-party 
vote shares associated with home precincts for 
i and j, where P; = democratic;/(democratic, + 
republican,). By using P; and P;, mismatch, is 
the imputed probability that persons 7 and j 
voted for different candidates in 2016. In all 
tables, regressions control for progressively finer 
(i, J) location pairs, culminating in five-digit 
geohash (geohash-5) boxes, a global grid of rec- 


tangular areas, each about 3 miles by 3 miles 
in size. 

To control for confounds including demo- 
graphics, distance, and travel time, our regres- 
sions compare Thanksgiving dinner durations 
between travelers with the same home and des- 
tination areas. For example, regression 3 com- 
pares two travelers if and only if they both live 
in ZIP code X and visit ZIP code Y. The co- 
efficient of interest B measures the reduction 
in Thanksgiving dinner duration between trav- 
elers within the same F;; comparison groups but 
who likely voted differently than their Thanksgiv- 
ing hosts. Standard errors are clustered at the 
home precinct-cross-destination precinct level. 
We use progressively tighter spatial controls to 
control for both demographics and travel dis- 
tance simultaneously. 

The results in Table 1 indicate that families 
that were likely to have voted for different pres- 
idential candidates spent about 30 to 50 fewer 
minutes together—subtracted from an average 
Thanksgiving dinner time of 4.2 hours—after 
controlling for both travel distance and location- 
correlated demographics. As we add finer spa- 
tial controls, our estimate of B remains fairly 
stable, with a point estimate of 56.3 + 14.6 min 
under our tightest geohash-5 controls. In table 
S4, we report qualitatively identical results when 
demographics such as race, age, education, income, 
and employment are controlled for separately. 

We examine the two components of mismatch,, 
P,(1 - P;) and (1 - P,)P;, to separately measure the 
effect of voting disagreement among Democratic- 
precinct residents (DPRs) visiting Republican- 
precinct residents (RPRs) and vice versa. Table 2 
demonstrates that, conditional on traveling, DPRs 
shortened their visits to RPR hosts by about 
20 to 40 min, whereas RPRs shortened their 
visits to DPRs by about 50 to 70 min. F-test re- 
sults indicate that these estimates are statistically 
different (P < 0.0001 in four of five specifica- 
tions), with RPRs shortening their cross-party 
stays by more minutes than DPRs. 

When investigating whether these effects in- 
teract with local political advertising, we find 
that cross-partisan Thanksgiving dinners are 
further shortened by around 2.6 min on average 
for every 1000 political advertisements aired in 
the traveler's home media market (Table 3). Some 
media markets in swing states saw more than 
26,000 ads over the course of the campaign, 
implying a 69-min-shorter Thanksgiving dinner 
for vote-mismatched families in Orlando, for ex- 
ample, compared to those in markets without 
advertising. Although this effect may not be solely 
due to advertising, which may be correlated with 
other campaign activities such as rallies, cam- 
paign visits, and fundraising efforts, these re- 
sults bolster the conclusion that measured effects 
on Thanksgiving dinner duration likely stem 
from an increased intensity and salience of par- 
tisan differences. 

The results in table S1 support this finding 
and report the results of a placebo test con- 
cerning whether advertisements in 2016 affected 
Thanksgiving dinner behavior the year before 
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airing. Regardless of whether we pool smartphone 
users or split the sample into DPRs and RPRs, we 
find no evidence of preexisting partisan effects 
in regions that witnessed high advertising levels. 
Although our empirical results estimate briefer 
Thanksgiving dinners among cross-partisan gath- 
erings in both years, the ad-related amplification 
of this effect is present only in 2016, both in 
statistical significance and magnitude, support- 
ing our conjecture that the main effect is most 
likely political in nature. 

An examination of destination choices sug- 
gests that travelers did not change plans to 
reduce political divisions from 2015 to 2016. 
Among travelers who traveled in both years— 
the strongest possible control for demographic 
and spatial confounds—we observe no appre- 
ciable difference in the distribution of likely 
political mismatch (fig. S1). This finding sug- 
gests that travelers were more likely to change 
the duration of Thanksgiving gatherings than to 
change the destination. 

Finally, tables S2 and S3 estimate linear 
probability models for the choice of whether 
to travel for Thanksgiving, in both 2015 and 
2016. When matched residents living within 
1.5 miles of each other are compared, DPRs 
reduced their likelihood of travel between 2015 
and 2016 by 2 percentage points more than com- 
parable RPRs, an effect that increases substantially 
in areas with heavy political advertising. 

Examining only those residents included in 
both the 2015 and 2016 data yields qualitatively 
similar results. Among residents at home on 
Thanksgiving morning in both years, 56.4% 
traveled for Thanksgiving in 2015, whereas 51.9% 
traveled in 2016 (7 = 28,890; Fisher’s exact text, 
P < 0.0005). Accompanying this difference is a 
reduction in Thanksgiving dinner duration for 
those cross-partisan dinners that still occurred. 
By comparing travelers who went to the same 
location both years (n = 1271), we estimate that 
politically mismatched gatherings declined by 
42.1 + 41.4 min. Although this small sample size 
precludes statistical significance, this estimate is 
very close to our findings shown in Table 1. 
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By aggregating across the 77% of American 
adults who own smartphones (17), our results 
suggest that partisan differences cost Americans 
73.6 million hours of Thanksgiving time with 
others in 2016, 47.8% from DPRs and 52.2% 
from RPRs. Political advertising-related parti- 
sanship comprised 15.9 million of lost person- 
hours, 46.3% from DPRs and 53.7% from RPRs. 
Altogether, an estimated 33.9 million person- 
hours of cross-partisan discourse were elimina- 
ted, perhaps creating a feedback mechanism 
by which partisan segregation reduces oppor- 
tunities for close cross-party conversations. 

Our findings have several implications, both 
for the literature and for campaign policy. After 
the 2016 election, anecdotal media reports and 
online social-media behavior (8) demonstrated 
an avoidance of personal confrontations over 
political issues among Democratic voters, find- 
ings our study corroborates. RPRs, however, 
were more sensitive to partisan differences at 
Thanksgiving dinners, an effect that supports 
findings of greater partisan-selective exposure 
among Republicans in news-media consumption 
(79). Our results suggest that partisan polariza- 
tion extends in quantitatively meaningful ways 
to close family settings and that political adver- 
tising and related campaign efforts can exacer- 
bate these fissures. As abbreviated Thanksgiving 
gatherings tend to accumulate in regions with 
greater campaign activity, policies designed to 
shorten campaigns may reduce the private costs 
of political polarization. 
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Ancient human parallel lineages 
within North America contributed 
to a coastal expansion 
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Little is known regarding the first people to enter the Americas and their genetic legacy. 
Genomic analysis of the oldest human remains from the Americas showed a direct 
relationship between a Clovis-related ancestral population and all modern Central and 
South Americans as well as a deep split separating them from North Americans in Canada. 
We present 91 ancient human genomes from California and Southwestern Ontario and 
demonstrate the existence of two distinct ancestries in North America, which possibly split 
south of the ice sheets. A contribution from both of these ancestral populations is found in 
all modern Central and South Americans. The proportions of these two ancestries in 
ancient and modern populations are consistent with a coastal dispersal and multiple 


admixture events. 


n increasing body of archaeological (1-3) 

evidence shows that the initial peopling 

of the Americas occurred at least a few 

thousand years prior to the spread of the 

Clovis cultural complex ~13,000 years ago 
(all dates are calibrated) (4), with a majority 
of well-supported Pre-Clovis sites clustered in 
coastal areas and around glacial edges (1, 3, 5). 
Studies of ancient and modern genomes have 
uncovered four distinct ancestry components 
within the Americas arriving in three hypothe- 
sized waves: the most recent Thule-related Neo- 
Eskimo ~2000 years ago, the Saqqaq/Dorset 
Paleo-Eskimo ~4500 years ago (both restricted 
to the Arctic region), and a “First American” dis- 
persal prior to 13,000 years ago that split within 
North America into a northern and a southern 
branch (6-10). The northern branch is ancestral 
to populations including Algonquian, Na-Dené, 
Salishan, and Tsimshian speakers from Canada 
(NAM), whereas the southern branch includes 
the ancestors of the Clovis individual (Anzick-1) 


and all Mexicans, Central Americans (CAM), and 
South Americans (SAM) (9-12). Within the south- 
ern branch there is some localized evidence of 
early population structure, as a few modern 
Amazonian populations show an excess genet- 
ic affinity to Australasians (13, 14). The second 
oldest North American genome, The Ancient One 
(Kennewick Man, 8700 to 8400 years ago), is 
likely to have derived from the ancestral north- 
ern branch but is a poor proxy for modeling this 
ancestry, given its low sequencing depth (10). 
Here, we investigated the ancestral relation- 
ship between the northern (NAM) and southern 
(Mexico, CAM, and SAM) branch populations. To 
do so, we sequenced 91 ancient whole genomes 
from North America, mainly from two geograph- 
ic areas: the California Channel Islands in the west 
and Southwestern Ontario in the east, near mod- 
ern Algonquian-speaking populations (Fig. 1A 
and table S1) (15). Both of these areas show ev- 
idence of occupation from at least 13,000 years 
ago (5, 16) and are geographically located south 


of the known distribution of the ancient Neo- and 
Paleo-Eskimo dispersals (6). We radiocarbon- 
dated 27 individuals (table S2) (75) to between 
~4800 and ~200 years ago and sequenced all 
genomes to an average depth of 0.007 to 13.6x 
(tables S1, S3, and S4) (15). Mitochondrial DNA 
(mtDNA) haplotypes were recovered from all 
samples (tables S1 and S5) (75) and Y chromosome 
haplotypes from 34 of the male individuals (fig. 
S1 and data S1) (15). In addition, a set of mod- 
ern whole mitochondrial genomes (n = 45) 
were resequenced from a previous study to ex- 
plore sex-specific migration patterns on the west 
coast of Southern California (tables S3 and S5) 
(15, 17). 

All ancient Native American individuals 
clustered with modern Native Americans on a 
worldwide principal components analysis (PCA) 
(Fig. 1B) (15, 78). In a regional plot that includes 
Siberians, the northern and southern branches 
show clear distinction (Fig. 1B, inset). Ancient 
Californians cluster alongside southern branch 
populations near to Anzick-1, whereas the an- 
cient Southwestern Ontario (ASO) population 
clusters with modern Algonquian speaking pop- 
ulations and The Ancient One (Fig. 1B). Modern 
and ancient Athabaskans map between ASO 
and Northeast Siberian populations (Chukchi/ 
Koryak) (Fig. 1B). The Upper Sun River individ- 
uals (USR1 and USR2) map near to Shuka Kaa, 
between Central Siberians and the northern 
branch populations (Fig. 1B). 


the ancient genomes within the context of a 
worldwide panel (Fig. 1, B and C, fig. S2, and 
data S2) (75) indicate that, relative to the ASO, 
extant NAM as well as ancient Pacific Northwest 
Coast (PNWC) and ancient Northern Athabas- 
kans have up to 50% more Arctic-related ancestry, 
prominent in Greenland Inuits and also found in 
Siberian Eskimos (Fig. 1C) (5). These differences 
are further confirmed by significant Z (Z > 3) 
scores for D-statistics DMbuti,SiberianPop;Mixe, 
TestPop) (20) and f5-statistics f(ASO,TestPop; 
Mbuti) (2/, 22) (table S6 and data $3) (15). We 
found no significant evidence of gene flow into 
the ASO from any non-American population 
(table S7 and data S3) (15). 

All ancient Californian genomes clustered to- 
gether with southern branch (Anzick-1, Mexico, 
CAM, SAM) populations on the regional PCA 
(Fig. 1B) (75). As suggested by the archaeological 
and osteological evidence (23-25), radiocarbon- 
dated individuals from San Nicolas Island from 
early (ESN) and late (LSN) periods (Fig. 1D) 
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clustered into two distinct genetic populations 
in both the autosomal and uniparental markers 
with evidence of 11% autosomal continuity be- 
tween the early and late populations (Fig. 1E) (75). 
The genome-wide diversity estimates (26) of the 


earliest-dated LSN individuals are greater than 
those of the ESN individuals (fig. S3) (15), which 
is consistent with higher effective population 
sizes in the LSN source population and/or re- 
cent admixture. Outgroup /3(Test,X;Mbuti) and 


D(Mbuti,Test;Mixe,X) tests show that the ESN 
shares more drift with SAM (Z > 3) than with 
geographically proximal populations (data S3) 
(15), which suggests that the ESN is related to a 
population that expanded into South America. 
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Fig. 1. Ancient individuals, population genetic analyses and modeling. 
(A) Sites of newly sequenced ancient individuals are designated by 
colored triangles. Comparative modern populations and ancient individuals 
are designated by black circles and triangles, respectively. (B) PCA with ancient 
individuals projected onto modern Native American and Siberian variation. 
Inset: Ancient genomes projected onto modern worldwide data. (C) Visualiza- 
tion of model-based ancestry analysis at fivefold cross-validation—supported 
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appropriate (15). (E) A model that explains genetic diversity in the Late 
Southern Channel Island populations through three-way admixture. 
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By contrast, the LSN population shares more 
drift and alleles with geographically proximal 
populations (Z > 3) (data S3) (15). 

We modeled the population history of the 
Americas using qpGraph (15, 27) and found that 
the ASO and Mexican (Pima) populations were 
consistently outgroups to sets of clades formed 
by Anzick-1, SAM (Surui), and ESN populations in 
analyses that did not involve admixture (fig. $4) 
(15, 21). Fit between the data and the tree could 
be significantly improved when modeling ancient 
Californian, modern Pima, and Surui populations 
through admixture of two basal ancestries that 
we call ANC-A and ANC-B (Fig. 2A) (15). The ESN, 


NCI/SB 


83 
Surui 


Northern Channel Islands and Santa Barbara 
(NCI/SB), and Surui populations share similar 
proportions of both components, while the Pima 
have a higher ANC-B component (Fig. 2A) (15). 
We used qpGraph to estimate the ANC-B con- 
tribution in modern CAM and SAM populations 
and found it to vary within a range of 42 to 71% 
(average 53%; table S8) (15). In SAM populations, 
the lower end of the spectrum of contributions 
of ANC-B are found in the Amazonian Equatorial 
Tucanoan-speaking groups (including Surui) 
(40 to 53%) and the highest in the Andeans (50 
to 71%) (Fig. 2B and table S8) (15), particularly in 
the Chilote and Huilliche (~70%) from locations 


Age of samples 


lv Ancient 
O Modern 


Percent ANC-B 
= 


0.35 
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overlapping the Monte Verde site (~18,500 to 
14,500 years ago) (Fig. 2B). 

The clear separation of ANC-A and ANC-B an- 
cestries is further supported by the sharing of 
unambiguous, derived haplotype segments in 
modern Surui and Pima populations (27) with 
both the ASO (CK-13) and Anzick-1 individuals 
(fig. S5) (15). The results of this analysis are con- 
sistent with ancient substructure and a separa- 
tion of at least a few thousand years between the 
ANC-A and ANC-B populations prior to merging 
(fig. S6) (15). The summary of evidence presented 
here allows us to reject models of a panmictic 
“first wave” population from which the ASO 
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Fig. 2. Visual model of ancestry components and distribution of proportions in the Americas. (A) A model with four admixture events that offers a 
good fit to the data (Z = 0.888) (15). (B) Scale of ANC-B ancestry from 0% in Anzick-1 to 100% in the ASO and modern Algonquian-speaking populations. 
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Fig. 3. Dispersal models that are consistent with the results of this study. Red and blue indicate ANC-A and ANC-B, respectively; symbols denote 
admixture event(s). Locations of admixture events are hypothetical. (A) A model with one admixture event in North America. (B) A model in which 

an ANC-B population first reached South America, followed by an ANC-A population with multiple admixture events. (©) The same model as (B), but 
reversing the populations. (D) A model with multiple admixture events and dispersals. 
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diverged after the peopling of South America or 
in which solely the ANC-A population contributed 
to modern southern branch populations. Be- 
cause populations vary in ANC-A and ANC-B pro- 
portions but do not differ significantly in their 
affinity to non-American populations (table $7) 
(15), it is possible that ANC-A and ANC-B split 
within America as opposed to Beringia where 
there would have been ongoing gene flow with 
Siberia. 

Four possible models can explain the contri- 
bution of both branches to CAM and SAM popu- 
lations: (i) an admixture event in North America 
prior to the peopling of South America (Fig. 3A); 
(ii) ANC-B-related ancestral population(s) dis- 
persing into South America first, followed by a 
dispersal of ANC-A-related population(s) and ad- 
mixture of the two branches occurring in South 
America (Fig. 3B); (iii) ANC-A-related ancestral 
population(s) dispersing into South America first, 
followed by a dispersal of ANC-B-related pop- 
ulation(s) and admixture of the two branches 
occurring in South America (Fig. 3C); and (iv) 
multiple admixture events occurring in North 
America, with multiple dispersals into South 
America (Fig. 3D). Additional ancient DNA from 
terminal Pleistocene human remains within the 
Americas is needed to determine which model 
best describes the sequence of events con- 
stituting the complex population history of 
the Americas. 
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Ancient genomes from Iceland reveal 
the making of a human population 
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Opportunities to directly study the founding of a human population and its subsequent 
evolutionary history are rare. Using genome sequence data from 27 ancient Icelanders, we 
demonstrate that they are a combination of Norse, Gaelic, and admixed individuals. We 
further show that these ancient Icelanders are markedly more similar to their source 
populations in Scandinavia and the British-lrish Isles than to contemporary Icelanders, who 
have been shaped by 1100 years of extensive genetic drift. Finally, we report evidence of 
unequal contributions from the ancient founders to the contemporary Icelandic gene pool. 
These results provide detailed insights into the making of a human population that has 
proven extraordinarily useful for the discovery of genotype-phenotype associations. 


istorical sources (7) indicate that Iceland 
was settled by people from Norway and 
the British-Irish Isles between 870 and 
930 CE (2, 3). Throughout the preceding 
century, the Norse had raided, traded, 
and settled in the islands and coastal regions 
of Ireland, Scotland, and northern England (4). 
Settlers in Iceland are thought to number ~8,000 
to 16,000 individuals, with the population rare- 
ly exceeding 50,000 until 1850 (3), after which 
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there was a rapid expansion to its current size 
of 330,000. 

Studies of mitochondrial DNA (mtDNA) and 
Y-chromosomes from contemporary Icelanders 
indicate that 62% of their matrilineal ancestry 
stems from Scotland and Ireland and 75% of 
their patrilineal ancestry is Scandinavian (5-7). 
Moreover, mtDNA control region sequences from 
68 Icelandic skeletal remains (~1000 years old) 
are more similar to contemporary source pop- 


ulations in Scandinavia, Scotland, and Ireland 
than to contemporary Icelanders (8). This was 
explained by a loss of mtDNA haplotypes from 
the Icelandic gene pool due to extensive genetic 
drift (8), supported by findings from whole- 
genome sequencing (WGS), genome-wide asso- 
ciation studies (9), and population genetics 
analyses (10). 

To directly assess the formation and subse- 
quent evolution of a human population, we 
selected 35 ancient Icelanders for WGS (Fig. 1), 
most of whom represent the first generations of 
settlement (11) (Table 1 and tables S1 and S82). 
The 27 individuals passing quality control (11) 
(tables S3 and S4) were sequenced to between 
0.18x to 30.7x average read depth (median = 
0.71) (Table 1). The data display characteristics 
of ancient DNA (aDNA), and their authenticity 
is supported by multiple analyses (1) (figs. S1 
and S2 and tables S4 to S6). 

The sex of 26 of individuals was confidently 
determined by calculating the proportion of 
sex chromosome reads that mapped to the Y- 
chromosome (R,) (2) (fig. S3 and table S7). The 
result for one individual, YGS-B2, was ambig- 
uous, as R,, = 0.055 exceeded the upper limit 
for females (0.016) but was below the lower limit 
for males (0.075). The read depth of sex chromo- 
somes relative to autosomes indicates that YGS- 
B2, who died before adulthood, carried two 
X-chromosomes and one Y-chromosome (fig. S4). 
This is further supported by a clearly assigned 
Y-chromosome haplogroup and X-chromosome 
heterozygosity in the range observed for females 
(table S8). To our knowledge, Klinefelter syndrome 
has not been reported in aDNA studies to date. Of 
the 24 individuals from pre-Christian (<1000 CE) 
burials (Table 1), 19 (79%) were male (X? test 
against expectation of 50%, P = 0.008). As the 
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Fig. 1. A map of Iceland showing the locations of skeletal remains sampled for this study. 
Circles indicate females and squares, males. Eight samples (light gray labels) were ultimately 


excluded from further analysis. 
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individuals were not selected by morphologically 
predicted sex, this implies sex differences in burial 
practices during the period (13), such that female 
burials were either rarer or less likely to be dis- 
covered in our time. 

Little can be inferred about the phenotypes 
of the ancient Icelanders. However, we report 
haploid genotypes for a set of rare disease- 
associated sequence variants discovered in the 
Icelandic population and variants thought to have 
been subject to positive selection in Europeans 
(tables S9 and S10). 

We next examined the relationship between 
the ancient Icelanders and contemporary pop- 
ulations using single-nucleotide polymorphism 
(SNP) arrays (table S11). As the read depth for 
most ancient Icelanders was too low to call dip- 
loid genotypes, haploid genotypes were used in 
all analyses (77). We performed a principal com- 
ponents analysis (PCA) of 2139 contemporary 
individuals from 28 European populations. When 
projected onto the first two principal components 


(PCs), all ancient Icelanders plot within clusters 
of contemporary Scandinavians and British-Irish 
Islanders (fig. S5). 

We also performed a PCA restricted to con- 
temporary individuals from Scandinavia (n = 
3118), the British-Irish Isles (n = 1436), and 
Iceland (n = 916) based on 404,066 SNPs. We 
observed a separation between contemporary 
Scandinavians and British-Irish Islanders on PC2 
(Fig. 2A), reflecting allele frequency differences 
that have accumulated across several millennia 
of drift. PC1 reveals an even more marked sep- 
aration of contemporary Icelanders from their 
source populations. This divergence is also seen 
in the distribution of genetic distances (F's) 
between Icelanders and their source populations 
(fig. S6A). 

The most likely cause for this divergence of 
Icelanders is 1100 years of genetic drift result- 
ing from an initial founder event and subsequent 
small population size (8-10, 14). This interpreta- 
tion is supported by PCA projections of 25 


ancient Icelanders (24 pre-Christian and one early- 
Christian, <1104 CE), all of whom plot within 
the range of PC1 occupied by contemporary 
Scandinavians and British-Irish Islanders (Fig. 
2A and fig. S7). Accordingly, Fs; distances show 
that allele frequencies in ancient Icelanders are 
more similar to those of the source populations 
than to those of contemporary Icelanders (fig. 
S6B). Furthermore, ancient Icelanders show no 
evidence of the geographic patterns of genetic 
variation found in contemporary Icelanders (fig. 
S8). The implication that source populations 
experienced much less drift than Icelanders 
(8) is supported by PCA projections of ancient 
individuals (nm = 16) from the British-Irish Isles 
dating from ~4000 to 1100 calibrated years 
before the present (17) (table S12), who cluster 
within contemporary British and Irish popula- 
tions (fig. $9). 

Whereas some ancient Icelanders appear to be 
unadmixed Scandinavian or British-Irish Island- 
ers, others resemble a mixture of these ancestral 


SSS RT 
Table 1. Summary of genomic sequence data from ancient Icelanders. 


Time period Classification Sample  Archaeo- 
based on logical 
8751/°°Sr date 

values 

Pre-Christian Migrant DA-V-A8& <1000 


M4¢ dating Auto. Auto. SNP SNP 
(cal CE) genome genome overlap overlap 
depth of covered (%) with with 
coverage (x) European North 
ref. (%)+ European 
ref. (Yo) £ 
<1050 Broil 89.12 90.14 93.92 


Sexchr. mtDNA chrY 87Sr/°°Sr ratio 
karyotype _haplo- haplogroup 
group 
XX H1 - 0.7121 


Not available = FSS-Al <1000 


13" century & 
Not available = FOV-Al >1000 


and later 


NA Ory 18.37 14.99 15.67 
1120 One 48.41 4469 46.45 
1246-1302 0.68 43.26 37.24 38.91 
NA 0.57 36.05 34.03 35.42 


XXY Jicla Rilblala2ala NA 
XY Jiblala Rial 0.7061 
XY HV17a_ —— Riblala2ala2clal NA 
XY H1 Riblala2ala NA 


*Samples marked with an asterisk symbol (*) were excavated from the same site as samples that have been subjected to radiocarbon dating, yielding the date estimate 


980 to 1020 cal CE. tN = 227,056 SNPs. 


Ebenesersdottir et al., Science 360, 1028-1032 (2018) 


=N = 404,066 SNPs. §NA, not available. 


1 June 2018 


2 of 5 


8l0z ‘LE Aew uo /Bio Beweoualos soue!0s//:djjy WO papeojuMOGg 


RESEARCH | REPORT 


groups (5-8) (Fig. 2A). Contemporary genotyped 
individuals were grouped into Norse (n = 2138, 
Norway and Sweden) and Gaelic (n = 459, de- 
fined here as Ireland and Scotland, excluding 
Orkney), and their relationships were tested 


with D-statistics (15) in the form D(Yoruba, X; 
Norse, Gaelic) (Fig. 2B and table S13). 

We ran ADMIXTURE in supervised mode sep- 
arately for each ancient Icelander with the Norse 
and Gaelic reference populations, yielding re- 
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Fig. 2. Ancestry of ancient Icelanders. (A) Ancient Icelanders projected onto the first two 
eigenvectors of a PCA based on contemporary Scandinavians, British-lrish Islanders, and Icelanders. 
Proportion of variance explained is shown in parentheses. Symbols for ancient individuals are as 
specified in Fig. 1. (B) D-statistics reflecting the differential affinity of ancient Icelanders (X) to Norse 
and Gaelic reference populations, with the Yoruba from Nigeria (YRI) as an outgroup (n = 91). 
Symbols are shaded as in Fig. 1. (©) Estimated Norse and Gaelic ancestry proportions for ancient 
Icelanders using ADMIXTURE in supervised mode. 
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sults (Fig. 2C) consistent with both PC2 axis 
position in Fig. 2A and D-statistics (Fig. 2B) 
(Pearson's |r | > 0.98 between all three ancestry 
assessments). Table 1 shows that Y-chromosomes 
from the 22 ancient Icelandic males belong to 
haplogroups common in Norse (I1 and Ria) and 
in Gaels (R1b1) (1) (table S14). An association 
between I1/Rila status and autosomal Norse 
ancestry was observed in the 19 pre-Christian 
males (p = 0.02, one-sided f¢ test, Fig. 2C), con- 
sistent with them dating to the first generations 
of settlement. 

We estimated the mean Norse ancestry of the 
settlement population (24 pre-Christians and 
one early Christian) as 0.566 [95% confidence 
interval (CI) 0.431-0.702], with a nonsignificant 
difference between males (0.579) and females 
(0.521). Applying the same ADMIXTURE analy- 
sis to each of the 916 contemporary Icelanders, 
we obtained a mean Norse ancestry of 0.704 
(95% CI 0.699-0.709). Although not statistically 
significant (¢ test p = 0.058), this difference is 
suggestive. A similar difference of Norse ancestry 
was observed with a frequency-based weighted 
least-squares admixture estimator (16), 0.625 
[Mean squared error (MSE) = 0.083] versus 0.74 
(MSE = 0.0037). Finally, the D-statistic test DVR, 
X; Gaelic, Norse) also revealed a greater affinity 
between Norse and contemporary Icelanders 
(0.0004, 95% CI 0.00008-0.00072) than between 
Norse and ancient Icelanders (—0.0002, 95% CI 
-0.00056-0.00015). This observation raises the 
possibility that reproductive success among the 
earliest Icelanders was stratified by ancestry, as 
genetic drift alone is unlikely to systematically 
alter ancestry at thousands of independent loci 
(fig. S10). We note that many settlers of Gaelic 
ancestry came to Iceland as slaves, whose sur- 
vival and freedom to reproduce is likely to have 
been constrained (17). Some shift in ancestry must 
also be due to later immigration from Denmark, 
which maintained colonial control over Iceland 
from 1380 to 1944 (for example, in 1930 there were 
745 Danes out of a total population of 108,629 in 
Iceland) (J8). 

For 20 of the 24 pre-Christian Icelanders, stron- 
tium isotopes 86 and 87 were measured from 
dental enamel (11), revealing whether they spent 
their first 6 years in Iceland (nonmigrants) or 
elsewhere (migrants). Three are deemed mi- 
grants on the basis of high *’Sr/®°Sr ratios 
(>0.710). These likely first-generation settlers 
were unmixed; DAV-A8 and DAV-A9 (from the 
same site) were Norse, and SSG-A4 was Gaelic 
(Table 1 and Fig. 2C). SSG-A2 and SSG-A3 (from 
the same site as SSG-A4) have lower ®’Sr/*°Sr 
ratios, albeit too high for a childhood solely in 
Iceland. Notably, SSG-A3 is estimated to be an 
equal mix of Norse and Gaelic, indicating that 
some admixture occurred before arrival in Iceland, 
perhaps in Viking settlements in Scotland or 
Ireland. 

One intriguing implication of the extensive 
drift that has accumulated in the Icelandic gene 
pool is that DNA samples from earlier gener- 
ations could be dated through the proportion of 
drift shared with the contemporary population. 
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Fig. 3. Shared drift of ancient and contemporary Icelanders. 

(A) Scatterplot of D-statistics reflecting Iceland-specific drift. To aid 
interpretation, we included values for ancient British-lrish Islanders 
and a subset of contemporary individuals (who were correspondingly 
removed from the reference populations). (B) Estimated Norse, 


To test this hypothesis, we sequenced two 
Icelanders from the 13th (FOV-A1) and 17th 
(KOV-A2) centuries. Projection of their geno- 
types onto the PCA in Fig. 2A resulted in place- 
ment partway between contemporary Icelanders 
and their source populations, consistent with the 
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genotypes of FOV-A1 and KOV-A2 being shaped 
by only part of the drift that characterizes mod- 
ern Icelanders. 

Ascatterplot of two D-statistics, D(YRI, X; Norse, 
Icelandic) and D(YRI, X; Gaelic, Icelandic), for all 
27 ancient Icelanders (table S15) effectively dis- 
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PC1 (0.10%) 


Gaelic, and Icelandic ancestry for ancient Icelanders using ADMIXTURE 
in supervised mode. (C) Ancient Icelanders projected onto a PCA 

of contemporary Scandinavians and British-lrish Islanders. Black 
squares denote the five individuals who may have contributed 
disproportionately to the Icelandic gene pool. 


tinguishes the signature of Iceland-specific drift 
(an axis parallel to the diagonal line) from Norse- 
Gaelic ancestry (perpendicular deviation from 
that line) (Fig. 3A). Consistent with the PCA in 
Fig. 2A, pre-Christian Icelanders plot close to 
contemporary Norse and Gaels. Furthermore, 
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KOV-A2 is closer to contemporary Icelanders, 
whereas FOV-A1 plots roughly halfway between 
them and the contemporary Gaelic and Norse 
source populations. 

Five pre-Christian Icelanders (VDP-A5, DAV- 
A9, NNM-A1, SVK-A1 and TGS-A1) fall just out- 
side the space occupied by contemporary Norse 
in Fig. 3A. That these individuals show a stronger 
signal of drift shared with contemporary Iceland- 
ers is also apparent in the results of ADMIXTURE, 
run in supervised mode with three contempo- 
rary reference populations (Norse, Gaelic, and 
Icelandic) (Fig. 3B). The correlation between the 
proportion of Icelandic ancestry from this analysis 
and PC1 in Fig. 2A is |7| = 0.913. One possible 
explanation of this result is misclassification of 
these individuals as early Icelanders. However, 
this is doubtful, as they exhibit pre-Christian 
burial features, early radiocarbon dates (DAV- 
AQ, SVK-A1 and TGS-A1), strontium isotopes in- 
consistent with being raised in Iceland (DAV-AQ) 
(Table 1), and a high proportion (>87%) of Norse 
ancestry (DAV-A9, NNM-A1, SVK-AI1, and TGS- 
Al, Fig. 2C). 

Contamination from contemporary Icelanders 
could also affect our results but is negligible in 
all five individuals (table S4). Three factors could 
account for the greater shared drift of the five 
pre-Christian Icelanders with contemporary Ice- 
landers. First, subtle genetic drift in the Norse 
gene pools might have shifted allele frequencies 
during the past 1100 years, making contemporary 
Norse less representative of their ancestors than 
is the case for Gaels. Second, they might originate 
from a subpopulation within Scandinavia that is 
poorly represented by contemporary populations 
or our reference samples. Third, these five indi- 
viduals may have contributed disproportionately 
to the gene pool of contemporary Icelanders. 

If genetic drift or population substructure in 
Scandinavian populations were responsible for 
their outlying positions (Figs. 2A and 3A), then 
the five individuals would also be outliers in a 
PCA generated only using contemporary indi- 
viduals from Scandinavia and the British-Irish 
Isles. However, as the five ancient Icelanders fall 
well within the cluster of contemporary Scandi- 
navians (Fig. 3C), we conclude that they, or close 
relatives, likely contributed more to the con- 
temporary Icelandic gene pool than the other 
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pre-Christians. We note that this observation is 
consistent with the inference that settlers of 
Norse ancestry had greater reproductive success 
than those of Gaelic ancestry. 

Ancient genomes are key to answering ques- 
tions about the formation and evolution of human 
populations during recent millennia. The settle- 
ment of Iceland occurred around 1100 years ago 
at the height of the Viking age. Our study reveals 
a highly admixed Norse and Gaelic gene pool of 
this founding population that was shaped by 
substantial genetic drift. The resultant founder 
events are one reason why the Icelandic popu- 
lation has contributed so much to the discovery 
of genotype-phenotype associations for rare se- 
quence variants (9). 
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Biotechnology Director 


(Salary: HK$164,500 - HK$179,850 per month, depending on relevant experience) 


Entry Requirements: 


Candidates are required to have: 

(a) aPhD Degree in the field of life science or biotechnology; 

(b) aminimum of 10 years’ relevant post-qualification experience at a senior management level in a scientific or corporate setting; 

(c) strong research or technical background in biotechnology and related areas (preferably enjoy recognised international standing in 
his/her own specialist field); good scientific and/or business networks; competency in research and development and/or business and 
product development; and understanding of current trends in technology and market (familiarity with the development and commercial 
application of biotechnology in the Mainland will be an advantage); and 

(d) strong command of both written and spoken English. Proficiency in written Chinese and spoken Cantonese and Putonghua will be 
an advantage. 


Duties: Biotechnology Director is a non-civil service position responsible for providing expert technology-related advice in biotechnology 
and related fields to the Commissioner for Innovation and Technology. He/She is required to lead a team of professional staff to provide 
assistance to the Innovation and Technology Commission (the Commission) in developing solicitation themes, undertaking project assessment, 
monitoring progress and evaluating the programmes in related fields under the Innovation and Technology Fund and other funding schemes 
administered by the Commission. He/She is also expected to maintain contacts with academia, research institutes, industries and other 
relevant bodies, liaising and supporting the activities of other government departments/organizations in the biotechnology field, and 
undertaking ad hoc activities related to the Commission’s operation and strategic development in this aspect. 


Terms of Appointment: The successful candidate will be appointed on non-civil service agreement terms for three years. 


Fringe Benefits: 

(a) An end-of-contract gratuity may be granted upon satisfactory completion of the contract. The amount of gratuity payable will be the 
sum, which, when added together with the Government’s contribution to the Mandatory Provident Fund scheme, equals 25% of the 
total basic salary drawn during the contract period; and 

(b) vacation leave with leave passage allowance, housing benefits in the form of non-accountable cash allowance and other benefits 
subject to the meeting of the eligibility criteria as stipulated in the regulations relating to the provision of such benefits. 


How to Apply: Candidates should apply in writing with a completed application form [G.F.340 (Rev.3/2013)] which can be downloaded from 
the Civil Service Bureau’s website (http://www.csb.gov.hk)], and a full curriculum vitae giving a detailed account of employment records 
with description of duties and past experience in research/project administration/product development in related fields. Copies of academic 
qualification certificates and record of previous employment should also be provided. Applications should reach the following mailing or 
email address on or before the closing date for application (according to postmark). Candidates who are selected for interview will normally 
receive an invitation for interview in about eight to ten weeks after the closing date for application. Those who are not invited for interview 
may assume that their applications are unsuccessful. Interviews are tentatively scheduled to be conducted in Hong Kong during late August/ 
September 2018 (subject to change at the discretion of the Government without prior notification to applicants). Applicants should make 
themselves available for the interview during that period. 


Application and Enquiries: Asianet Consultants (HK) Limited, 702-705, 7/F, Wilson House, 19 Wyndham Street, Central, Hong Kong. For 
enquiries, please call Mr Peter LIU at (852) 2530 0137 or send email to itc-bd@asianetconsultants.com. 


Closing Date for Application: 15 June 2018 
General Notes: 


(a) Asan Equal Opportunities Employer, the Government is committed to eliminating discrimination in employment. The vacancy advertised 
is open to all applicants meeting the basic entry requirement irrespective of their disability, sex, marital status, pregnancy, age, family 
status, sexual orientation and race. 

Non-civil service vacancies are not posts on the civil service establishment. Candidates appointed are not on civil service terms of 
appointment and conditions of service. Candidates appointed are not civil servants and will not be eligible for posting, promotion or 
transfer to any posts in the Civil Service. 

Persons who are not permanent residents of the Hong Kong Special Administrative Region may also apply for the post but will be 
appointed only when no suitable and qualified candidates who are permanent residents are available. 

Where a large number of candidates meet the specified entry requirements, shortlisting criteria may be devised to select the better 
qualified candidates for further processing. In these circumstances, only shortlisted candidates will be invited to attend the interview. 
It is Government policy to place people with a disability in appropriate jobs wherever possible. If a disabled candidate meets the entry 
requirements, he/she will be invited to attend the selection interview without being subject to any further shortlisting criteria. 

The entry pay, terms of appointment and conditions of service to be offered are subject to the provisions prevailing at the time the offer 
of appointment is made. 

Holders of academic qualifications other than those obtained from Hong Kong institutions/Hong Kong Examinations and Assessment 
Authority may also apply but their qualifications will be subject to assessments on equivalence with the required entry qualifications. 
They should submit copies of their official transcripts and certificates by post or email to the above mailing and email address. 
Non-civil service vacancy information contained in this advertisement is also available on the GovHK on the Internet at http://www. 
gov.hk and the Innovation and Technology Commission Homepage at http://www.itc.gov.hk. 
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WORKING LIFE 


By Nathan Bradshaw 


1038 


Finding a community in the lab 


tis a rite of passage for every college student: English 101. The class began with introductions. At 

28 years old, I knew that I would be older than the average student. After almost failing out of high 

school, I had enlisted in the U.S. Navy and spent the next 8 years operating the nuclear reactor on 

an aircraft carrier. But I was not prepared for the six precocious high school students who were 

in the community college class with me, getting some exposure to higher education and earning 

college credit. They hammered home the feeling I had been trying to keep at bay: I was too old to 
be starting this journey. I only returned for the next class because the Department of Veterans Affairs 
was paying me a living allowance while I attended college. But when my studies led me to my first 
research lab, I found some much needed common ground. 


I had started college with plans 
to become a pharmacist. In my 
introductory chemistry and biology 
courses, I found other “nontradi- 
tional” students—older students, 
veterans, students with kids, stu- 
dents who were also working full- 
time jobs—who grew into a cohort 
for me. The research I conducted 
as part of the class was challeng- 
ing, but it offered an opportunity 
to apply some of the skills I had 
learned in the Navy, reassuring 
me that my experience had value. 
I enjoyed research so much that, 
before I completed my first year of 
college, I decided that I would go 
on to pursue a Ph.D. 

When I transferred to a 4-year 
university, I regularly felt like an 
outsider among students 10 years 
younger than me with limited life 
experience. But again, research provided a place where 
I felt I belonged. When I started working in a lab on a 
project developing artificial muscles, a graduate student 
demonstrated how the muscles move when connected to 
an instrument called a Keithley source meter. The student 
had focused on synthesizing the muscles and wasn’t very 
familiar with the meter, which both generates and mea- 
sures voltage and current. I, on the other hand, had spent 
more than 6 years using an older version of this instru- 
ment for reactor maintenance in the Navy. I immediately 
felt at home. I could do this. 

My undergraduate research adviser gave me free rein to 
pursue new directions with the meter and design my own 
experiments. This was just one way that she encouraged 
me to capitalize on my experience and helped me learn to 
be an undergraduate in my 30s. Her support helped me 


“T feel lucky to have found 
colleagues and mentors who 
make me feel that I belong.” 


see college as a continuation of my 
career instead of starting over. The 
years I spent in the Navy became 
an asset instead of a gulf between 
me and my peers. 

I’m now pursuing my Ph.D., and I 
feel lucky to have found colleagues 
and mentors who make me feel that 
I belong. But I know that this is not 
everyone’s experience, and_ that 
academia and labs can be alienat- 
ing places. So, I do what I can to 
help foster a welcoming, supportive 
environment for the other research- 
ers around me. I regularly share the 
difficulties I faced in school and lab 
with junior members of the lab to 
help them understand that every- 
one struggles. I also co-founded a 
speaker series where visiting pro- 
fessors share stories about their 
own roots in underrepresented 
communities or their efforts to reach out to other groups. 
My goal is to help graduate students see the humans behind 
the science and connect with the speakers at a deeper level. 

These are just small steps. But they are based on the les- 
son that I learned through my years of struggling to find 
my place in academia: Community doesn’t depend on age 
or background, but on appreciating one another’s perspec- 
tives and skills and acknowledging that, despite differences, 
we face some common struggles. Taking the time and effort 
to build connections with other members of your lab and 
research community can help students of all backgrounds 
find success. It certainly did for me. 


Nathan Bradshaw is a graduate student at Northwestern 
University in Evanston, Illinois. Do you have an interest- 
ing career story? Send it to SciCareerEditor @aaas.org. 
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